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We explore the relationships between the computational problem of recognizing expander 
graphs, and the problem of efficiently approximating proof length in the well-known sys- 
tem of resolution. This program builds upon known connections between graph expansion 
and resolution lower bounds. 

A proof system P is (quasi-)automatizable if there is a search algorithm which finds a 
P-proof of a given formula f in time (quasi)polynomial in the length of a shortest P-proof 
of f. It is open whether resolution is (quasi-)automatizable. We prove several conditional 
non-automatizability results for resolution modulo new conjectures concerning the com- 
plexity of identifying bipartite expander graphs. Our reductions use a natural family of 
formulas and exploit the well-known relationships between expansion and length of res- 
olution proofs. Our hardness assumptions are unsupported; we survey known results as 
progress towards establishing their plausibility. The major contribution is a conditional 


hardness result for the quasi-automatizability of resolution. 
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Chapter 1 


Introduction 


1.1 Introduction 


1.1.1 Propositional proofs and computation complexity 


The formal study of computations has been a greatly influential and active area of math- 
ematical study in recent centuries, arising from the fundamental questions of the nature 
of computations and algorithms, and of which problems are efficiently solvable. At the 
heart of the many results and open questions of said field lies the notion that compu- 
tations may be mathematically formalized, and automated. An equally important, and 
closely related discovery is that mathematical proofs themselves can be formalized and 
automated. The novelty of applying formal reasoning to the very central activity of 
mathematicians is perhaps best demonstrated by the Incompleteness Theorem of Godel, 
perhaps the most famous result of mathematical logic. Gödel proved, in response to a 
program put forth by Hilbert to prove the consistency of arithmetic, that there exists a 
statement about the natural numbers which is true yet cannot be proven. An analogous 
result for computations followed soon after, when Turing showed the existence of natural 
problems which admit no algorithmic solution in the Undecidability Theorem. 


The particular relationship between proofs and algorithms on which this thesis is 
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based arises from the widely studied class of problems known best as NP cf. [45,77,87]. 
The class NP, informally, captures those problems which share the property that a pur- 
ported solution may be efficiently checked for correctness. The fundamental relationship 
of NP to the notion of efficient proofs is made succinct by Edmonds [56], who refers 
to problems for which every solution has a short, efficiently verifiable “proof” of its 
correctness. Cook and Reckhow [47] define a propositional proof system as an efficiently 
computable function from some language to the propositional tautologies, where f (1) = y 
may be taken to mean “zx is a proof of y”. An efficient, or polynomially bounded proof sys- 
tem is one for which |x| is at most polynomial in |y| for all proofs x and tautologies y. This 
natural definition coincides in a certain way with the fundamental NP-complete problem 
of propositional satisfiability, and its complement, the set of unsatisfiable propositional 
formulas (i.e. negations of tautologies): the existence of a polynomially bounded proof 
system is equivalent to NP being equal to its complement CoNP [48], which is widely 
conjectured to be false. Thus a fundamental program in the study of propositional proof 
systems is to prove that increasingly powerful systems are not polynomially bounded, 
towards proving that NP 4 CoNP. Only in recent decades have superpolynomial lower 
bounds on the length of proofs in certain systems begun to emerge. Even for so-called 
Frege systems, relatively weak systems which capture most natural formal reasoning for 
propositional logic, only superlinear lower bounds are known. Nonetheless, this is an 
active area of research, in which lower bounds have been shown for increasingly stronger 
systems, and whose study has lead to many results of interest beyond the fundamental 


NP Æ CoNP program. 


Another fundamental question of propositional proof systems is whether the act of 
finding proofs can be carried out efficiently. So-called automated theorem provers are of 
both theoretical and practical interest. Particular practical interests include the problem 
of automatic verification wherein we wish to prove that a boolean circuit computes a 


particular function, or computational intelligence problems wherein we wish for some 
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machine to efficiently deduce tautologies given premises describing a task or environment. 
By proving lower bounds on the length of proofs in a given system, one precludes the 
existence of an efficient automated theorem prover, since a given proof may require an 


infeasible amount of time to even write down. 


1.1.2 Resolution and efficient proof search 


Our primary concern in this thesis is with the well-studied propositional proof system 
of resolution. The resolution system is a very restricted form of a Frege system. Frege 
systems are a general sort of propositional proof system in which the lines of a proof are 
propositional formulas, and new lines are generated by previously derived lines using a 
set of schematic rules. For the special case of resolution, the lines of proofs are restricted 
to be clauses, i.e. conjunctions of literals, and there is a single rule by which new lines 
are derived, called the resolution rule. The resolution rule states that when x implies 
A and az implies B, we may infer that either A or B holds. Resolution is commonly 
used as a system of refutations, where given the negation of a formula as a conjunction 
of clauses, one derives the empty clause. 

Given these restrictions, resolution refutations admit a simple combinatorial charac- 
terization, which has led to several strong lower bounds on the length of resolution proofs, 
but also a strong understanding of which sorts of formulas are “hard” for resolution. More 
practically, resolution is closely related to backtracking algorithms for the propositional 
satisfiability problem, which effect a search over all truth assignments for one which sat- 
isfies the input formula. For several well-known algorithms, such as Davis-Putnam [51] 
and DPLL [50], a (failed) execution trace on an unsatisfiable input essentially constitutes 
a resolution refutation. Therefore if an unsatisfiable CNF formula @ requires refutations 
of exponential length, then such algorithms must run for exponential time given ¢@ as 
input. Despite such negative results however, DPLL-based algorithms constitute some of 


the most empirically efficient satisfiability algorithms, and so there is interest in under- 
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standing their complexity beyond the worst-case time lower bounds given by resolution 
lower bounds. In view this limitation of weak systems such as resolution, Bonet et al. [32] 
proposed another definition of efficiency for proof search algorithms, known as automati- 
zability. A proof system is automatizable when there is an algorithm which finds a proof 
of a given tautology ¢, which is efficient with respect to the length of the smallest proof 
$. Roughly speaking, the question of automatizability is concerned with proof search al- 
gorithms which are “efficient” if we discount the (potentially exponential) time required 
to even write down a proof, or more practically speaking, if there is an algorithm which 
is efficient in all cases where short proofs exist. It has been proven that a restricted form 
of Frege systems (which includes resolution) is not automatizable, assuming a widely- 
believed cryptographic conjecture [30,32,83]. Subsequently, Alekhnovich and Razborov 
have proven that resolution is not automatizable, assuming a plausible parameterized 
complexity conjecture [5]. However, the authors later suggest in [6] that the formulas 
used to show non-automatizability in [5] may be considered isolated and ad hoc, and 
hence raise the question of the practical relevance of their result; they proceed to suggest 
a program of identifying “natural” classes of tautologies for which their is an automatiz- 
ing algorithm. A major motivation for this thesis is to pursue the opposite direction, by 
proving the (conditional) non-automatizability of resolution for more natural formulas 


than those formulas used in the proof of [5]. 


1.1.3 Expansion and resolution lower bounds 


The simple combinatorial characterization of resolution proofs has made resolution an 
appealing area of study for proof complexity researchers. In [68], Haken introduced the 
so-called bottleneck counting method to show exponential lower bounds on the length of 
resolution refutations of the pigeonhole principle, i.e. the family of contradictory CNF 
formulas which claim there is an injective mapping from n + 1 pigeons to n holes. Some 


time later, Haken’s lower bound technique was simplified by Beame and Pitassi [21], who 
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refined the complicated counting of Haken’s method using a method of random restric- 
tions. Haken's argument shows that a resolution refutation of the pigeonhole principle 
must contain (exponentially) many “bottlenecks” in the form of complex clauses, which 
are wide in the sense that they contain many propositional literals. Beame and Pitassi, 
however, effectively simplify Haken’s argument so that they only need show that a pi- 
geonhole principle refutation contains a single complex (and wide) clause, suggesting a 
connection between refutation length, i.e. number of clauses, and width, i.e. the minimum 
number of literals in any clause. This intuition was made formal in the elegant results of 
Ben-Sasson and Wigderson [24], who show a general tradeoff between length and width 
in resolution refutations. In [24], known exponential lower bounds on length for several 
families of tautologies are reduced in a mostly uniform way to lower bounds on width. 
The second part of their result demonstrates a generalized approach to showing lower 
bounds on refutation width by using the theory of expander graphs. 

Since their introduction in the 1970’s, the term expander graph has come to encompass 
graphs with a variety of useful combinatorial properties, generally centered around the 


following informal properties: 


1. an expander graph is sparse, yet highly connected, or 


2. each set of vertices in an expander graph has many neighbors. 


Graphs of this special sort have shown to have far-reaching applications, for example 
in the study of error-correcting codes, size lower bounds for boolean circuits, and error 
reduction in randomized algorithms. The application of interest in this thesis, of course, 
is in proving lower bounds on the width in resolution refutations. In [24], and further 
expanded upon in [22], Ben-Sasson and Wigderson formalize the notion of expansion for 
sets of constraints (i.e. contradictory CNF formulas), and describe a general approach 
for showing that refutations of expanding constraints must contain a wide clause. More- 


over, many “natural” families of formulas considered in proof complexity research are 
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based on an underlying graph structure, for which the formula expansion is intimately 
related to expansion properties of the underlying graph; for example, the Tseitin tautolo- 
gies [104], graph ordering principle [65], perfect matching principles [105] or randomly 
chosen CNF formulas, cf. [90]. Our primary motivation for this thesis is to investigate 
the application of the strong connections between resolution size and graph expansion 
towards computational complexity problems for resolution, such as automatizability. In 
particular, we show that the automatization of resolution is as computationally hard as 
identifying certain sorts of expansion in bipartite graphs. These results comprise progress 
towards showing natural families of formulas for which the problem of finding resolution 


proofs is hard even in the context of automatizability. 


1.2 Overview 


As described in the preceding sections, the goal of this thesis is to further develop rela- 
tionships between expansion and proof size for the resolution system in the context of 
computational complexity, and in particular for the problem of automatizability. Our 
results constitute progress towards a strong theorem of non-automatizability for resolu- 
tion, and related systems, proved using “natural” formulas. In this section we give a 
high-level view of the organization of this thesis, and summarize our major contributions 
towards the aforementioned goals. 

In chapter 2, we discuss computational problems related to expander graphs. In sec- 
tion 2.1 we give formal definitions of expansion as are relevant to this thesis as well as 
establish some essential facts concerning expansion. We also give a brief introduction 
of parameterized complexity and approrimation (optimization) problems, but assume the 
reader to be familiar with at least undergraduate-level methods, concepts and results of 
traditional computational complexity. In section 2.2 we give a survey of computational 


problems from the existing literature which are naturally related to graph expansion, 
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with focus on known (in)approximability results and hardness results. In section 2.3 we 
introduce the sort of computational problems for expander graphs which are of primary 
interest in subsequent chapters. We present hardness results for certain expansion prob- 
lems. We then discuss the average-case hardness of bipartite expansion, and show some 
consequences for average-case hardness due to properties of randomly chosen graphs. 

In chapter 3 we give our main results, wherein we reduce graph expansion prob- 
lems to the automatization of resolution. In section 3.1 we give formal definitions of 
automatization and survey known results, both positive and negative, concerning the au- 
tomatizability of resolution. We also present the size-width tradeoffs for resolution and 
other material in preperation for proving resolution lower bounds. We also present the 
graph pigeonhole principle, which is the basis for our reductions in subsequent sections. 
In section 3.2 we give our first results reducing the “pseudo-approximation” of expansion 
to the automatizability of resolution. These results illustrate the conceptual simplicity 
of reducing expansion to proof size, using straightforward reductions. We also show the 
limitations of such straightforward reductions in proving strong non-automatizability re- 
sults. In section 3.3 we give our main results showing non-automatizability from a new 
average-case hardness assumption for expansion. We conclude with a discussion of open 


problems and further directions for research. 


1.2.1 Main contributions and results 


The main contributions of this thesis constitute progress towards two related directions 
of research. The first is to respond to the questions of Alekhnovich and Razborov [6] by 
demonstrating non-automatizability results via reductions using “natural” formulas, and 
moreover, that draw upon the well-known relationships between expansion and resolution 
size. Our Theorems 3.16 and 3.18 (section 3.2) illustrate a simple reduction relating the 
expansion of a given bipartite graph, to the length of resolution refutations of the so-called 


graph pigeonhole principle, an efficiently constructible proposition formula stating that a 
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bipartite graph cannot contain a complete matching. Theorem 3.21 (section 3.3) improves 
on this reduction to show that resolution is not quasi-automatizable (where an “efficient” 
search algorithm is allowed to run in quasipolynomial time), assuming a new conjecture 
about the average-case hardness of identifying expander graphs; to the author’s knowl- 
edge this is the first (conditional) hardness result for the quasi-automatizability of res- 
olution. The average-case hardness conjectures for bipartite expansion, as suggested by 
Buresh-Oppenheim [36], may be of independent interest in the study of complexity of 
approximation problems. Finally, in the investigation of an open problem, we present an 
NP-hardness result related to the system of pool resolution [64] which is of independent 
interest; this result is orthogonal to the main focus of this thesis, and is subsequently 


presented in Appendix A. 


1.3 Preliminaries 


1.3.1 General notation and conventions 


For a natural number n we denote by [n] the set (1,2,...,n). We denote probabilities 
by Pr[] where Pr|A, B] is the probability of events A and B occurring, and Pr[—A] is the 
probability of event A not occurring. For D a distribution and X a random variable over 
the probability space of D, we write X ~ D to indicate that X is distributed by D, to be 
read as X is selected from distribution D. We write f(n) = poly(n) to indicate that there 
exists a constant c > 0 such that f(n) < n° for n sufficiently large. Similarly, we write 
f(n) = polylog(n) to indicate that there exists constant c > 0 such that f(n) < log®n, 
for n sufficiently large. For simplicity, we may tacitly assume that expressions are integer 


valued when expected in context. 
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1.3.2 Graph notations 


For a graph G we denote the set of vertices as V(G) and the set of edges as E(G). We 
may abuse notation and write (u,v) to denote an edge {u,v} in an undirected graph. 
Throughout all graphs are assumed undirected unless explicitly stated otherwise. For 
vertices u,v € V(G) we say that u is adjacent to v to mean {u,v} € E(G); conversely, 
we say that an edge {u,v} is incident on u. An edge {u,v} is incident on a set U of 
vertices if u € U or v € U. For a graph G and vertex u € V(G) we denote by Iglu) the 
neighborhood of u in G, as the set of all e € E(G) incident on u. For a set of vertices 
U CV(G) we write P'g(U) to mean |J ey Lalu). The proper neighborhood of U C V(G) 
in G, denoted [(U), is defined as T¢(U) \ U. For a set U € V(G), define the boundary 
of U in G, denoted by OG(U), as the set of all u € T¿(U) with exactly one adjacent 
vertex in U. For a vertex u € V(G), the degree of u, denoted d(u), is defined as |Pa(u)!. 
We denote by d(G) the maximum degree over all vertices of graph G. Say that a graph 
G is d-regular if d(u) = d for all vertices u € V(G). We may omit the subscript G from 
any of the above notations when a particular graph is implied by context. 

A bipartite graph G is a graph for which there is a partition of V(G) into sets L(G) and 
R(G), called the left side and right side of G respectively, such that E(G) € L(G) x R(G). 
Let G be a bipartite graph. The left-degree (or simply degree) of G, denoted dr (G) is the 
maximum degree over all u € L(G). The right-degree, denoted da(G), is the maximum 
degree over u € R(G). Say that G is d-regular (resp. d-right-regular) if d(u) = d for all 
u € L(G) (resp. R(G)). We denote by G?” the set of all bipartite graphs with |L(G)| = m 
and |R(G)| = n. If G € G" we say that G maps m to n (equivalently, that G is from m 
to n). We denote by Km,n the complete bipartite graph from m to n, i.e. with all edges 
from L(G) to R(G) present. 

A matching in a bipartite graph G is a subset M C E(G) where for each (a,b), (c, d) € 
M both a 4 cand b # d. We may consider a matching as a 1:1 onto function M : A= B 


where A = {u : (u,v) € M} and B = {v : (u,v) € M}, in which case we say M is over 
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(or on) A, and into B. A matching is total if it is over L(G) and into R(G). A matching 
is partial if it is not total. 

For a graph G and V C V(G), the subgraph induced by V is the graph consisting of 
vertices V and all edges of E(G) incident on V. For E C E(G) the subgraph induced by 
E is the graph consisting of edges E and all vertices of V(G) on which F is incident. For 
A a set of vertices (resp. edges) of G, we write G — A to denote the subgraph induced 
by V(G) \ A (resp. E(G) \ A). For a set A = (V UE) for V C V(G) and EC E(G), we 
denote by G — A the graph (G — V) — E. For H a subgraph of G we denote by G — H 
the graph G — (V(H)U E(H)). 

Throughout this manuscript the reader may prefer to consider bipartite graphs as 
being undirected but with a fixed “orientation” of the vertex partitions. The term graph 
will refer to general (undirected) graphs unless context indicates a bipartite graph. We 
adopt the convention that vertices are identifies by integers, so that for G € G’", L(G) = 
[m] and R(G) = [n], unless explicitly specified otherwise. 

In what follows we make frequent mention of Hall’s Theorem concerning the existence 


of matchings in graphs, of which we now state a particular specialized form: 


Theorem 1.1 (Hall’s Theorem) Let G be a bipartite graph. For any S C L(G), S 
has a matching if and only if for all S E S, [P(S5| > |S". 


1.3.3 Propositional logic 


We briefly introduce necessary definitions of propositional logic. Define a (propositional) 
variable x to range over {0,1}, which correspond in the natural way to truth values 
(1, T}. A literal of a variable x is either x itself, in which case we x is a positive literal, 
or its negation =x in which case we say ~z is a negative literal. We may also denote the 


0 xl respectively, so that ~z! = x° and 72° = qt. 


positive and negative literals of x as x 
For X a set of variables, an assignment to X is a function p : X — {0,1,x}, where 


if p(x) = x we say that x is unassigned by p. If x‘ is a literal, we may write p(x‘) to 
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indicate the value z‘ takes on under p in the obvious way. We denote by supp(p) the 
set of variables assigned by p, i.e. p~'({0,1}). An assignment p is total exactly when 
supp(p) = dom(p); more generally, p is total for any set X C supp(p). We may also refer 
to assignments as restrictions. 

A clause over variable set X is a set of literals from X which we interpret as the dis- 
junction of its members. A term over X is a set of literals interpreted as the conjunction 
of its members. For a clause or term C we denote supp(C) the support of C, which is 
the set of variables for which some corresponding literal appears in C. A CNF formula, 
is a set of clauses to be interpreted as the conjunction of its members. A DNF formula 
is a set of terms interpreted as the disjunction of its members. We may refer to either 
as simply a formula when the particular sort is clear from context. For a formula f we 
denote by var(f) the union of supp(C) over all clauses (terms) C € f, and by lit(f) the 
set of all literals over var( f). We denote the empty clause as the value 0, i.e. the truth 
value of the empty disjunction. The empty term is similarly denoted by the value 1. For 
k a positive integer, a k-CNF (resp. k-DNF) formula is one for which every clause (resp. 
term) C satisfies |C] < k. 

Let X be a set of variables and p an assignment with X C dom(p). For a literal z‘ 
from X, we define x}, in the obvious way by x[, = 1 — e @ p(x), if x € supp(p), or 
x}, = x° otherwise. For a clause C over X we define CJ, by applying p to each literal 


in C and simplifying; that is, 


1, if xx], = 1 for any 2% in C 
C= í 
{xt EC: p(x) =x}, otherwise. 


For a CNF formula f over X we define f|, by applying p to each C € f so that 


fi 0, if Cl, = 0 for some C € f 
p = 
{C|,:C € f,CT, #1}, otherwise. 
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Similarly, for t a term and f a DNF formula over X, we define £|, and fl, as follows: 


ét 0, if x‘, = 0 for any xê in C 
p pal 
(1 EC: p(x) =x}, otherwise. 
1, if tlp = 1 for some t E€ f 
fiz = 


{t|,:t € f,tl, #0}, otherwise. 

We may also extend the above notation to an arbitrary function g : X — Y, so that 
glp : X \supp(p) — Y. Say that p satisfies an object A if A], = 1; conversely, say that p 
falsifies A if AJ, = 0. We may refer to CJ, as C restricted by p, or equivalently p applied 
to C. 

We may extend an assignment p by composing it with assignment p' so long as p is 
consistent with p' in the following sense: for each x € supp(p) N supp(p’), p(x) = plz’). 
Then we define po p' as (po p')(x) = p(x), if p(x) 4 x, or p'(x) otherwise. Say that 
assignment p is a sub-assignment (or sub-restriction) of p' if supp(p) C supp(p’) and p is 
consistent with p’. 

For sets A,B of boolean functions, say that A semantically implies B, denoted as 
A B, if any assignment which satisfies all f € A also satisfies each g € B. More 
generally, for a class R of assignments, say that A Er B if any p € R satisfying all 


f € A also satisfies each g € B. 


1.3.4 Propositional proof systems 


We briefly introduce the formal notion of a proposition proof system as defined by Cook 
and Reckhow [48], then proceed to define the resolution refutation system. By convention 
we consider the language TAUT to consist of all propositional tautologies represented as 
DNF formulas, and CoSAT to consist of all unsatisfiable formulas represented as CNF 


formulas. 
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A propositional proof system (or simply proof system) is a polytime computable pred- 
icate f € &* x TAUT for some proof alphabet X, with the property that p € TAUT iff 
there exists 7 € %* such that f(m, 6), in which case we say that m is an f-proof of 6. 
In this thesis we are primarily concerned with (propositional) refutation systems which 
prove the unsatisfiability of Ap € CoSAT, for $ € TAUT. 

A proof system f is polynomially bounded if there exists polynomial p such that for 
any $ € TAUT we have that f(a, 6) implies |r| < p(|¢|). For two proof systems f and g, 
say that g polynomially simulates f (or p-simulates f) if there exists a polynomial q and 
algorithm A such that, if m is an f-proof of ¢ then |4(7)| < q(|a|) and A(z) is a g-proof 
of $. Say that systems f,g are polynomially equivalent (or p-equivalent) if f p-simulates 
g and g p-simulates f. 

In many cases a proof system f is defined by describing objects which comprise the 
lines of a proof as well as a system of rules from which new lines may be derived from 
previous ones. The resolution system, defined below, has lines consisting of clauses and 


a single rule defined as follows: 


Definition 1.1 (Resolution rule) Given two clauses A, B such that x€ € A anda'~* € 


B, we may derive C = A \ xf U B\ 21“; call C the resolvent of A and B on x. 


Definition 1.2 (Resolution derivation) Let A be a set of clauses and C a clause. A 
finite sequence Tr = (C1, C2...Cp) is a resolution derivation of C from A, denoted as 
AF” C, exactly when Cy = C and for each C; € Tr, either C; € A (in which case we say 
C; is an axiom), or there exists Cj, C;, E T with jı < ja < i such that C is a resolvent 
of Cj, and C;o. 

A resolution refutation of a contradictory formula @ is a derivation of the empty 
clause 0 from @. 


We write At B to mean there exists some resolution derivation m of B from A. 


We may now formally define resolution as a system for refutations, i.e. derivations of 
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the unsatisfiable empty clause from a contradictory formula. The reader can easily verify 


that the following system is sound and complete. 


Definition 1.3 (Resolution) Define the propositional refutations system resolution by 


the predicate RES, where RES(r, 6) if and only if p F" 0. 


Definition 1.4 (Complexity measures for resolution) Let 7 be a finite sequence of 
clauses. Define the length |r| as the number of clauses in m. Define the width w(T) as 


maxce, |C|. For a contradictory formula ¢, define 


Sarlo F 0) Saef a [7] 


wrl9 F 0) =def min w(r). 


We call SrR(ọ F 0) and waló F 0) the resolution size and resolution width of ¢, respec- 


tively. 


In many applications it is convenient to consider the following combinatorial repre- 


sentation of resolution refutations: 


Definition 1.5 (Derivation graph) Let m be a derivation of clause C from A. We 
associate with m a unique directed graph Gr with labellings 0, : E(G,) — lit(A) and 
Ur : V(G,) > var(A) U {x} as follows: each verter i € V(G) is uniquely associated with 
the clause C; € T. All axioms in A are sources (leaves) in Gr and are labelled with x 
(representing “unlabelled”. If Cy, is derived from C;,C; € T by a resolution step, we add 
edges from each ofi, j to k; moreover, if x is the variable resolved upon to obtain Cy and 


x€ € Civera z, lr(i, k) = x€ and =>, 


We may refer to the graph (and labellings) associated with derivation 7 transparently 
by considering m to be represented by its labelled graph. Note that the graph of and 


derivation is directed acyclic with unique sink associated with the derived clause. 
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We also define certain well known refinements of resolution. Formally, a proof system 
gis a refinement of system f if all g-proofs are also f-proofs. Intuitively, we may consider 
a refinement of resolution to be a refutation system as resolution, but with additional 
combinatorial constraints on the form of refutations. For example, a derivation is called 
tree-like exactly when its corresponding graph is a tree. We formally define tree-like 


resolution, or simply tree resolution as follows: 


Definition 1.6 (Tree resolution) Let m be a resolution derivation of B from A. We 
write A FF B if the underlying graph of m is tree-like. Define the refutation system 
REST as resolution but with refutations constrained to be tree-like. For ¢ a contradictory 


formula, define 


PoP ean n 


wrlo F 0) =aef min w(r). 
TdF PO 


We also define so-called regular resolution, in which the graph of a refutation must 
satisfy the combinatorial property that each variable appears once on every path from 0 


to an axiom. 


Definition 1.7 (Regular resolution) Let 7 be a resolution derivation of C from for- 
mula A. Say that T is a regular derivation, denoted by A FF, C, if for every path from 
the sink to a source in G,, all edges are labelled with distinct variables. A regular refu- 
tation is a regular derivation of the empty clause 0. Define the refutation system Reg as 
resolution but restricted to regular refutations. 

For $ a contradictory formula, define 


Sreg(@F 0) =aeg_min fal 


OFF eg0 


Weg OE 0) =dep min w(n). 


reg 
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Clearly resolution p-simulates both regular and tree-like resolution. It is also well- 
known (cf. [4,23]) that neither regular nor tree-like resolution p-simulate resolution. Since 
tree-like and regular resolution are refinements of resolution, clearly Sr( 0) < Saló F 


0) and Sreg[ F 0) < Srl E 0) for all unsatisfiable ¢. 


We also define restrictions of derivations in the natural way: 


Definition 1.8 (Restricted refutation) Let r be a derivation of clause B from A and 
let p be an assignment to var(A). Define the restriction of m by p, denoted by rr|,, be the 


sequence of clauses obtained by restricting each Ci € t by p, discarding any clause C; for 


which C;¿|,= 1. 


A refutation system f is said to be closed under restrictions if applying a restriction p 
to an f-refutation 7 of A yields an f-refutation of A],. A refutation system f is natural 
if for any unsatsfiable A, S(A[, F 0) < S;(AF 0). All forms of resolution defined above 
are well-known to be natural, as well as closed under restrictions. 

Finally, we also consider the family of refutation systems Res(k) for k € N. The 
systems Res(k) generalize resolution by allowing the lines of refutations to be k-DNF 
formulas, where the resolution rule is extended to allow resolving on k-terms. We present 


below the definition from [82]. 


Definition 1.9 (Res(k) refutation) Fixk > 1. Let A be a set of k-DNF formulas and 
let C be a k-DNF formula. A Res(k) derivation of C from A is a sequence of k-DNF 
formulas 7 = (fi, f2,..., fn) such that fa = C, and for each £ € |n], either fe € A, or 
there exists F = fi,,..., fi, ET, Sk, i< € fort € [5], such that fe follows from F by 


one of the following rules: 


Subsumption : => for k-DNFs C and D 


Cv, OVA izo. ¡ Ti 
CV Niet] Ti 


Cut : CAE eV icll zi for k-DNFs Ci, Co and 2},... „z; E lit(A) 


AND — introduction : for k-DNF C and 21,...,2; € lit(A) 
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A Res(k) refutation of A is a Res(k) derivation from A of the empty clause. We write 
AFTC ifr is a Res(k) derivation of C from A. Define the refutation system Res(k) by 
the predicate RES, where RES; (rr, phi) if and only if ọ HZ 0. 


For a contradictory formula ¢, define 
SilóoF 0) = j 
rl F 0) =def in [| 
where |r| denotes the number of lines in refutation r. 


Clearly Res(1) is precisely resolution with the subsumption rule, also known as the 
weakening rule; it is therefore well-known that Res(1) is p-equivalent to resolution (cf. 
[102]). It is easy to verify that Res(k) is sound and complete as a system of refutations 


for any k > 1. Moreover, Res(k) is also strongly sound in the following sense: 


Lemma 1.2 (Lemma 1 [102]) Fix k > 1 and let 7 be a Res(k) derivation. For any 
f Em, if f is inferred from fı, f2,... fj, then for each set of terms ty € fi. ti E€ J; 


where each t; contains no negated literal of ty for i’ € |j], there exists term t € f such 


that esla = t. 


Chapter 2 


Complexity of expansion and related 


graph properties 


In this chapter we consider the computational hardness of problems related to recogniz- 
ing which graphs are expanders. Roughly speaking, an expander graph is a graph which 
is highly connected yet relatively sparse. In section 2.1 we precisely define the forms of 
expansion we will consider, as well as review the necessary complexity frameworks. In 
section 2.2 we briefly survey known results, both positive and negative, concerning the 
hardness of several problems concerning expansion-like properties of graphs. In section 
2.3 we investigate the complexity of problems concerning expansion of bipartite graphs. 
We give some basic hardness results and introduce conjectures on the hardness of iden- 
tifying bipartite expanders for use in subsequent chapters. The goal of this chapter is 
motivate the computation hardness of problems concerning bipartite expander graphs, 


and introduce related hardness conjectures for use in results of chapter 3. 


2.1 Preliminaries 


In this section we precisely define expansion and expander graphs. We also give back- 


ground material on parameterized complexity and approximation problems. The reader 


18 
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is assumed familiar with at least undergraduate-level traditional complexity theory, i.e. 
the polynomial-time hierarchy, completeness for complexity classes, and polytime (Karp, 


Turing) reductions. 


2.1.1 Expander graphs 


There are many nearly-equivalent ways to formalize expansion for graphs as described at 
the outset of this chapter. In this thesis we are concerned primarily with the application 
of expander graphs to propositional proof complexity, for which the notion of vertex 
expansion is most relevant, particularly as applied to bipartite graphs. We now fix the 


required definitions. 


Definition 2.1 (Vertex expansion) Let G be a graph (bipartite graph) and let n =4ef 
|\V(G)| (resp. n =a4ef |L(G)|). Fix parameters 0 < r < n and c > 0. Then G is (r,c)- 
expanding if for each set S C V(G) (resp. S C L(G)), |S| =r satisfies |[(S)\S| > c-|S|. 
Say that G is (r,c)-boundary expanding if for each such set S we have |0(S)\S| > e- |S]. 
Define an (r,c)-(boundary) expander as a graph G which is (r,c)-(boundary) expanding 
forallr' <r. 

We refer to r as the set size parameter and to c as the expansion parameter. We 
may say that G is (r,c)-(boundary) contracting to mean that G is not (r, c)-(boundary) 


expanding. 


Note that the parameters r, c are not required to be constants, and hence may depend 
on any properties of the graph or even each other. Common cases of interest are where r 
is a constant fraction of n and cis a constant or function of the degree. It is understood 
that we only consider the bipartite form of expansion for explicitly bipartite graphs. 
Note that we permit expanders to have expansion parameter c < 1. Observe that any 
expander must have c < d(G); an optimal expander is one for which c is arbitrarily close 


to the degree. We now state several straightforward facts about expanders. 
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Proposition 2.1 Fix graph G and expansion parameters r, C. 
1. If G contains an edge-induced subgraph which is (r,c)-expanding, then G is (r,c)- 
expanding. 
2. If G is (r,c)-boundary expanding, then G is (r, c)-expanding. 


3. If G is bipartite and (r,c)-expanding for c > d(G)/2, then G is (r,2c — d(G))- 


boundary expanding. 


Proof of Proposition 2.1 Properties 1 and 2 follow directly from definitions. To show 
property 3, fix G € G”. Fix arbitrary r < m, c > d(G)/2 and S C L(G) of size r. By 
definition of boundary expansion, each non-boundary element of T(S) has at least two 


neighbors in S. Since |[(S})| < d(G)|S|, we have 


d(G) - |S| + |A(S)] > 2P(5)] > 2c - 15] 


from which the desired result follows. 


We also consider the following optima for each expansion parameter. The following 
definitions characterize “good” expanders for applications in which one parameter is 


fixed. 


Definition 2.2 (Vertex expansion optima) Let G be a graph (bipartite graph) and 
let n = |V(G)| (n = |L(G)|). For arbitrary c > 0 define the maximum expanding size 
ralc) as the largest r < n such that G is an (r,c)-expander. Define the minimum con- 
tracting size ro(c), dual to rg(c), as the smallest r > 0 such that G is (r,c)-contracting. 
For any 1 < r < n, define the maximum expansion ca(r) as the minimum, over all 


S C V(G) (S C L(G)) of size at most r, of (T(S) \ S|/|S|. 


Clearly if rg(c) > 0 then G is an (r¢(c), c)-expander, and conversely if rafc) < n 
then G is (re(c) + 1, c)-contracting. We also define the following definitions which also 


characterize the intuitive notion of graphs which are highly connected yet sparse. 
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Definition 2.3 (Edge expansion) Let G be a graph on n vertices. For U C V(G), 
let cut(S) denote the number of edges with endpoints in each of S, V(G)X S. Define the 
edge expansion coefficient eg as the minimum over all S C V(G) of size at most n/2 of 
cut(S)/|S|. For any a € (0,1/2], define egla) as the minimum of cut(S)/|S| over all S 


such that an < |S| < (1 — a)n. 


Definition 2.4 (a-separator) Let G be a graph. For arbitrary a € (0,1], define an 
a-vertex separator (resp. a-edge separator) for G as a subset S of vertices (resp. edges) 
of G such that every connected component of Gls has size at most a-|V(G)|. Define the 


optima egla) and vala) to be the sizes of a minimal (edge or vertex) a-separator for G. 


For certain applications we will not be interested in the expansion properties of too- 
small sets of vertices. To facilitate this we may replace, in any preceding appropriate 
definition, the set size parameter r with a range [ro, rı], so that our existing definitions 
are preserved by replacing r with [1,r]. We may omit G as a subscript or argument from 


any of the notations defined above when a particular graph is clear from context. 


2.1.2 Parameterized complexity and optimization problems 


In considering the computational complexity of identifying graphs with the properties 
defined in section 2.1.1, it is appropriate to employ the framework of parameterized 
complexity. In this framework one considers so-called parameterized problems, for which 
an instance has natural parameters; for example, the CLIQUE problem, which asks if 
a graph G has a k-clique, is parameterized with parameter k and instance size |G|. 
The central question of parameterized complexity is, roughly, whether a problem has an 
efficient (polytime) algorithm for each fixed value of its parameters. This framework has 
been developed, in part, to provide more detailed analysis of parameterized problems in 
cases where traditional hardness results (e.g. for NP or PSPACE) are considered overly 


pessimistic for practical purposes, where applications may naturally restrict parameter 
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values. In this section we present simplified forms of the necessary concepts; the reader 


is directed to [54,55] for a complete discussion and further motivation. 


Definition 2.5 (Parameterized problem) A parameterized problem ¿s a language 
LCYxY for an instance alphabet © and parameter alphabet %’. For any Y C X" 


we define Ly as the set of all x such that (x,y) € L for y € Y . 


Definition 2.6 (Fixed-parameter tractability) A parameterized problem L is fixed- 
parameter tractable, and in the class FPT, if and only if there exists a recursive function 
f, global constant c and algorithm A which decides (x,y) € L in time f(|y|) - |a|° for all 


instances (x,y). 


Observe that the class FPT defined above is equivalent if we instead require that 
algorithm A runs in time f(|y|) + |x|" on any instance (x,y) [40]. We also require the 


following notion of reducibility, analogous to the definition of FPT. 


Definition 2.7 (Parameterized reduction) Let L and L' be parameterized language. 
We say that L parametrically reduces to L’, written L <y L' if there exists recursive 
function f, global constant c and algorithm A which computes from an instance (x,y) of 


L a pair (x*, y') such that 
1. (x,y) € L if and only if (2, y) € Ll’, 
2. y = gly) for g a function of only y, and 


3. A runs in time f(lyl) - |a|°. 


It is also possible to define a parameterized reduction in terms of an oracle algorithm 
which, given instance (x,y) to L only queries LẸ for some finite Y depending only on y. 
Clearly <% is transitive, and if L’ € FPT and L <, L’ then L € FPT. We now define a 


related notion of hardness under parameterized reductions. 
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Definition 2.8 (W-hierarchy) Let C be a single-output circuit over (2, A, V} where 
A, V gates may have arbitrary fan-in > 2. Say that a gate is large if it has fan-in greater 
than 2. The weft of C is the maximum number of alternations of large gates on any 
path from an input to the output. For each t > 1 define the parameterized problem 


t-NORMALIZED WEIGHTED SATISFIABILITY as follows: 


Instance: Constant-depth circuit C of weft t having n inputs, integer parameter k < n. 


Problem: Is there a input of Hamming weight k on which C outputs 1? 


For each t > 1, we define the class W|t] as the set of parameterized languages L which 
are parametrically reducible to t-NORMALIZED WEIGHTED SATISFIABILITY. Say that L 
is W[t]-hard if for all L' € W[t] we have L' <p L. A problem L is W{t|-complete when 
it is W[t]-hard and in Wft]. 

Define the class W[P] as the set of all languages parametrically reducible to the pa- 
rameterized problem of, given an (arbitrary) circuit C on n inputs and integer k < n, 
determining if C accepts some input of Hamming weight k. Define the W-hierarchy as 
the union of FPT, W[P] and all W[t] fort > 1. 


The above definitions imply the containments 


FPT C W[1] c W(2] C... < WIP] 


with FPT and W[1] analogous to P and NP respectively in the polytime hierarchy. It is 
conjectured that all of the inclusions above are proper, and thus W|1|-completeness may 
be considered strong evidence that a problem is not in FPT [53]. Observe that there 
may be multiple (natural) ways to parameterize a problem (i.e. to define which parts of 
the input are to be considered parameters). Thus inclusion or hardness results for any 
classes in the W-hierarchy only hold with respect to a particular parameterization. There 


are indeed problems which are in FPT for one parameterization, but complete for W[1] 


CHAPTER 2. COMPLEXITY OF EXPANSION AND RELATED GRAPH PROPERTIES 24 


for another (cf. [93]). We direct the reader to [53] for an extensive collection of known 
inclusion and hardness results for the W-hierarchy. 
We now give a brief introduction to optimization problems and approximation algo- 


rithms in the context of both traditional and parameterized complexity. 


Definition 2.9 (Optimization problem) Let L be a search problem and let cost(-) be 
a polytime function (cost function) taking solutions of L to Z. Define the search mini- 
mization problem (maximization problem) for (L, cost) is as follows: given an instance 
x of L, find solution y with minimal (maximal) cost. Define the decision minimization 
problem (maximization problem) for (L, cost) as, given instance x and integer k, decide 
if OPT, < k (OPT, > k), where OPT, denotes the minimum (maximum) cost cost(y) 


over all solutions to x. 


Definition 2.10 (Approximate algorithm) Let L be a search problem and cost a 
cost function for L. Let f be an arbitrary function taking instances of L to N. Say 
that an algorithm A f-approximates the search minimization (maximization) problem 
for (L, cost) if on instance x, A outputs a solution y such that cost(y) < f(x) - OPT, 
(cost(y) >1/f(x)-OPT,). Say that A f-approximates the decision minimization (maz- 


imization) problem for (L, cost) if given input (x, k}, 


A outputs 1 = > OPT, > k (resp. OPT, < k) 


A outputs 0 => OPT; f(a) <k (resp. OPT, > 1/f(x)- k) 


We call the function f(-) the approximation ratio, and note that f(x) may depend on 
OPT. 


We clarify that in the case that L is parameterized, f may depend on any parameters 
to L. Of particular interest are optimization problems which can be approximated to 


within a constant factor, or even to any constant factor. 
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Definition 2.11 (Approximable problems) Define the class APX (for “approrimable”) 
to consist of those optimization problems L such that there exists constant c and a poly- 


time algorithm A that c-approximates L. 


Definition 2.12 (Polytime approximation scheme) Let O = (L, cost) be an opti- 
mization problem (the particular sort is irrelevant). A polytime approximation scheme 
(PTAS for short) for O is an algorithm A taking as inputs (x,k) for x an instance of O 


and k € N, such that 


1. for any e > 0, the algorithm A(-,1/€) (1 +e)-approximates O, and 


2. for any constant e > 0, A(x,1/€) runs in time poly (|x|). 


An efficient polytime approximation scheme (EPTAS for short) for O is a PTAS but 


where A(x, k) runs in time f(k) -poly(|x|) for an arbitrary recursive function f : N — N. 


It is often convenient to consider an f-approximate algorithm for L as exactly comput- 
ing the solution to the f-approrimation problem for L, which is solved by any algorithm 
acheiving approximation ratio f. In this sense we may consider an EPTAS to be fixed- 


parameter tractable with respect to parameter k = 1/e. 


2.2 Complexity of edge expansion and related prop- 
erties 


In this section we survey some of the wealth of results concerning the complexity of 
problems related to expansion in definition. These problems represent the majority of 
work on expansion-related problems in the literature and serve as an introduction to the 
complexity of bipartite expansion. Certain results presented in this section also have ap- 
plications to proof-search algorithms discussed in subsequent chapters. We consider both 
(traditional) complexity and parameterized complexity, as well as (in)approximability re- 


sults. 


CHAPTER 2. COMPLEXITY OF EXPANSION AND RELATED GRAPH PROPERTIES 26 


2.2.1 General edge and vertex expansion 


Recalling the definition of edge expansion from section 2.1.1, one can think of a good edge 
expander as a graph with no bottlenecks, i.e. any (roughly even) partition of the vertices 
has high connectivity between the two parts. This property is of interest for applications 
such as algorithmic design (e.g. clustering or divide-and-conquer), multicommodity flows, 
and the theory of Markov chains. The general problem for edge expansion is nearly 
equivalent to the uniform (unweighted) version of the well-known problem of finding 


so-called sparsest cuts of graphs. We first formally define both aforementioned problems. 


Definition 2.13 (Minimum edge expansion) Define the problem MIN-EDGE Ex- 


PANSION as follows: given an undirected graph G, find S C V(G) with minimal cut(S)/|S}. 


Definition 2.14 (Minimum vertex expansion) Define the problem MIN-VERTEX Ex- 
PANSION as follows: given an undirected graph G, find S C V(G) with minimal |[(S) A 
51/15]. 


Definition 2.15 (Sparsest cut) Define the problem (unweighted) UNIFORM SPARS- 
EST CUT as follows: given an undirected graph G, find a sparsest cut, i.e. a subset 


SCV(G) with |S| < |V(G)|/2 which minimizes NET: 


Define the (unweighted) SPARSEST CUT problem as follows: given a graph G and a 
subset D C V(G)? of demand pairs, find a sparsest cut with respect to D. That is, for a 
cut S CV(G) (181 < |V(G)|/2), define dem(S) as the number of pairs (vı, v2) € D such 


that vı € S and va € S. Then a sparsest cut for D is a cut S which minimizes Pas 


Note that for any G, the minimum edge expansion eg is within a constant factor of 
2 from the optimal size of a uniform sparsest cut. Both forms of SPARSEST CUT and 
MIN-EDGE EXPANSION are known to be NP-hard [75,86]. However, the hardness of ap- 
proximating each problem is still mostly open, and in particular it is unknown whether 


either problem belongs to APX. There are several algorithms known for approximating 
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(UNIFORM) SPARSEST CUT; by our above observation, an f-approximate algorithm for 
UNIFORM SPARSEST CUT yields an O(f)-approximate algorithm for MIN-EDGE EXPAN- 
SION. In [85] it was shown that, using linear programming techniques, that SPARSEST 
CuT admits an O(log |D|)-approximate algorithm for D the set of demand pairs. This 
implies a O(log n)-approximate algorithm for UNIFORM SPARSEST CUT, and hence MIN- 
EDGE EXPANSION. Subsequently, using a semidefinite programming relaxation, Arora 
et al. [13] exhibited an O(ylog n)-approximate algorithm for UNIFORM SPARSEST CUT. 
Klein et al. [81] show that UNIFORM SPARSEST CUT is in fact in APX when restricted 
to graphs excluding any fixed graph as a minor (e.g. planar graphs). 

The authors know of no hardness of approximability results for UNIFORM SPARSEST 
CuT nor MIN-EDGE EXPANSION at the time of writing. Some conditonal hardness re- 
sults for SPARSEST CUT have been proven modulo the so-called unique games conjecture 
of Khot [78]. The unique games conjecture (or UGC) states, roughly, that NP can be 
characterized by unique games, a sort of constraint satisfiability problem formulated as 
a 2-player game (the details are irrelevant to this discussion; the interested reader is 


referred to [78,80]). Chawla et al. [44] show the following: 


Theorem 2.2 Suppose the UGC holds. Then for any constant c > 0, there is no polytime 


algorithm which c-approximates SPARSEST CUT, unless P = NP. 


In fact with a stronger assumption it is proven that no polytime algorithm ((/log log n)- 
approximates SPARSEST CUT modulo P 4 NP. Nonetheless Theorem 2.2 is sufficient to 
show that SPARSEST CUT is not in APX. 

For general graphs, there are well-known relationships between (vertex,edge) expan- 


sion optima, and the spectral parameter Aa, defined as follows: 


Definition 2.16 (Adjacency matrix and spectra) The adjacency matrix Ag of a 


graph G on n vertices is ann x n matriz with Acli, j) = 1 iff {i,j} € E(G). The 
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bipartite adjacency matrix of a bipartite graph G € Gy} is ann x m matrix Be where 
Beli, j) =1 iff (i,j) € E(G). 

For a (general) graph G define the Laplacian matrix (or normalized adjacency ma- 
trix) Qa as diag(d(v))vev(aj— Ac. Define the spectral parameter Ag as the second largest 


eigenvalue of Qa. 


An algorithm approximating Ag yields an approximate solution to the decision forms 
of MIN-EDGE EXPANSION and MIN-VERTEX EXPANSION. We first summarize results 
from [7-9,74] which relate Ag to edge and vertex expansion parameters, in the following 


theorem: 


Theorem 2.3 LetG be a (non-bipartite) graph on n vertices. Then each of the following 
hold: 


Thus determining bounds on Aa yield bounds on both cg and rg. Goldreich and 
Ron [67] demonstrate a randomized algorithm which, given a d-regular graph G and 
parameter A, accepts with high probability if Ag < A. Furthermore, they conjecture that 
said algorithm also rejects, with high probability, any graph G which is e-far from some 
G’ with Ag > A (two d-regular graphs G,G” on n vertices are e-far if they disagree on 
at least an e-fraction of edges). Czumaj and Sohler [49] have subsequently shown that, 


using appropriate parameters to the algorithm of [67], one may obtain the following: 


Theorem 2.4 For any integer d > 0, there exists a randomized algorithm A taking as 
inputs a d-regular graph GŒ, and parameters e, a. € (0,1), which behaves as follows: given 


G on n vertices and e, a, 


e A accepts with probability at least 2/3 if G is an (n/2, 0) -expander, and 
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e A rejects with probability at least 2/3 if G is at least e-far from any (n/2, a!)- 


expander, where a! = O(a?/(d? log(n/e))). 
where A runs in time at most O(d*,/n log(n/e)/(a7e?)). 


We observe that the algorithm of [67], and hence the algorithm of [49] above require 
that AG < 1 and hence are a priori not applicable to bipartite graphs (for which Ag = 1). 
However, for regular graphs, Aq is related to the biparite expansion of the so-called double 


cover of G as follows: 


Definition 2.17 (Double cover) Let G be a graph on n vertices. Define the double 


cover of G, denoted Hg, as the bipartite graph H € G} such that By = Ac. 


Theorem 2.5 (Theorem 2.1 [9]) Let G be ad-regular graph on n vertices and suppose 
that Ma > e. Then Hg is (r,c)-expanding for allr < n/2 and c = |1 + a (1 —£)]. In 


2e 
d+2e 


particular, Hg is an (n/2, (1 + ))-expander. 


Hence for left-regular, balanced bipartite graphs one may obtain a lower bound on 


ca(n/2) by bounding Ac where G is the double-cover of G”. 


2.2.2 Vertex and edge separators 


Closely related to the problem of minimum edge expansion is the problem of finding small 
separators in a graph. Recalling the definition of a-separators in section 2.1.1, this is the 
problem of finding a small set of vertices (edges) such that all connected components left 
by removing the separator are small. Finding small separators efficiently has applications 
in areas such as VLSI design, and solving other graph or flow problems (e.g. SPARSEST 


CUT). We now formally define the problems of interest for this section. 


Definition 2.18 (a-Balanced Separator) For anya € (0,1), define the a~ BALANCED 


SEPARATOR (a-BALANCED CUT) problem as follows: given a graph G on n vertices and 
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integer parameter k < n, determine if there exists a set |S| of vertices (edges), |S| < k, 


such that each component of Gls has at most an vertices? 


Definition 2.19 (Minimum a-Balanced Separator) For any a € (0,1), define the 
MIN a-BALANCED SEPARATOR (MIN a-BALANCED CUT) problem as follows: given a 
graph G on n vertices, find a minimal set of vertices (edges) such that each component 


of Gig has at most an vertices. 


Note that the above problems may also be denoted as (MIN) a- VERTEX SEPARATOR 
or (MIN) a-EDGE SEPARATOR. Both MIN a-BALANCED SEPARATOR and MIN a- 
BALANCED CUT are NP-hard, as are the corresponding parameterized problems. In 
fact, at the time of writing, no better algorithm for a- BALANCED SEPARATOR is known 
than the naive polyn(;)-time algorithm which tries all sets of k vertices, which may 
give indication that finding separators is not fixed-parameter tractable. It is proven 
in [89] that a7 BALANCED SEPARATOR is W[1]-hard (for all a > 1/2) by reduction from 
the W{1]-complete problem CLIQUE; furthermore, in [41] it is shown that a-BALANCED 
SEPARATOR is contained in W[P]. This constitutes strong evidence that a- BALANCED 
SEPARATOR is not fixed-parameter tractable. 

In light of the above hardness results, there is some progress towards efficient or fixed- 
parameter tractable approximation algorithms. We first take note of a result of [35], which 
states that an f(n)-approximate algorithm for MIN a-BALANCED SEPARATOR implies 
an f(poly(n))-approximate algorithm for MIN a-BALANCED CUT. We first present the 


following hardness of approximation results from [35]: 


Theorem 2.6 For any a < 1/2 and e > 0, there is no polytime algorithm which f- 
approximates MIN a-BALANCED SEPARATOR nor MIN a-BALANCED CUT, for f(G) = 
(1+|V|'/?-*/OPTg). This holds even if either problem is restricted to graphs with maz- 


imum degree 3. 
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Theorem 2.7 For any a < 1/2 and e > 0, there is no polytime algorithm which f- 


approximates MIN a-BALANCED CUT for f(G) = (1 + |V|*/OPTG). 


Note that Theorems 2.6, 2.7 can be considered as stating that no algorithm can achieve 
an absolute error guarantee below n*/2-* (n2-5). On the positive side, an O(log? n)- 
approximate algorithm for 1/2-BALANCED CUT, also known as MIN-BISECTION, is demon- 
strated in [61]. Further positive results provide so-called pseudo-approrimate algorithms, 
for which the solution is an a’‘-separator for a’ ~ a. The following theorem gives an 
improved approximation by allowing the balance to differ by a small constant; the result 


below from [62] improves on a previous algorithm from [60]. 


Theorem 2.8 (Theorem 6.2 [62]) For all constants a € (0,1) and e > 0, there is a 
randomized polytime algorithm which, given a graph which has an a-vertex separator of 


size k, finds an (a + €)-vertex separator of size at most ky/log k. 


A similar result is also given in [13] for edge separators, where the following PTAS-like 


deterministic algorithm is demonstrated: 


Theorem 2.9 (Corollary 2 [13]) For all a € [2/3,1) and e > 0, there is a determin- 
istic algorithm which, given a graph on n vertices which contains an a-edge separator of 


size k, finds an (a + e)-edge separator of size k in time poly(n) - 20/9, 


The above algorithms are of particular interest in that they imply good approximate 
(rather than pseudo-approximate) algorithms for graphs which contain an a-separator 
of size k, but no (a + e)-separators of size close to k for constant e > 0. Feige et al. 
conjecture in [62] that graphs which violate the aforementioned property characterize 


the hardest instances of a7 BALANCED SEPARATOR and a-BALANCED CUT. 


2.2.3 Tree width and branch width 


In this section we survey results related to the so-called tree width parameter of graphs. 


The tree width of a graph is closely related to the size of minimal separators, as well as 
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the complexity of certain algorithms for graph problems, particularly divide-and-conquer 
approaches (c.f. [26,99]). We first present the definition of tree decompositions on which 


tree width is based. 


Definition 2.20 (Tree Decomposition) A tree decomposition of (general) graph G is 
a tree T = (I, F) and family of subsets x = {X;:1 € I} with each X; C V(G),i € V(T), 


such that: 
1. Vier Xi = V(G), 
2. for all (u,v) € E(G) there exists i € I such that u,v € X;, and 
3. for alli, j,k € I, if j lies on the path from i to k in T then Xi N X; C X;. 


The width of a tree decomposition is max;c;|X;| — 1. The tree width wp(G) is the 


minimum width of any tree decomposition of G. 


We also mention the related property of branch width [98]; for the purposes of this 
thesis we only need know that branch width is within a small constant factor of tree 
width [27]. Tree width may be intuitively considered as a measure of connectivity, and 
in particular, as a measure of how much a given graph “resembles” a tree, so that graphs 
of low tree width may be considered more “tree-like”. In [28] it is shown, in particular, 
that any graph has a 1/2-vertex separator of size at most wr(G) + 1; further connec- 
tions are also established between tree width and the so-called separator number, which 
bounds the size of 1/2-vertex separators in all subgraphs of G. The problem of determin- 
ing the tree width of a given (general) graph is proven NP-complete in [12]; moreover, 
the problem remains NP-complete for bipartite graphs. Moreover in [28] the following 


inapproximability result is shown: 


Theorem 2.10 (Theorem 23 [28]) If P 4 NP, then there is no polynomial time ap- 


proximate algorithm computing a tree decomposition T(G) of a graph G which guarantees 
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w(T(G)) — wr(G) < n*, for e a constant in (0,1), n = |V(G)| and w(T(G)) the width 


of the tree decomposition. 


Conversely, [28] presents a polynomial time algorithm (in n = |V(G)| which finds a 
tree decomposition of G of width at most O(wr(G) log n). However, tree width approxi- 
mations are more often studied in the context of parameterized complexity, with wr(G) 
as the parameter of instance G. It was shown in [100], by a non-constructive argument, 
that there is a polytime algorithm to decide if a graph has w7(G) < k for constant k. 
Following this result several approximation algorithms for computing a minimal-width 
tree decomposition have been developed which run in time 2°“7(@)) . poly(|V(G)]) [11]. 
In particular, the results of [11] show that approximating the treewidth of a graph to a 
small constant factor (approximately 4) is fixed-parameter tractable. 

We also note the recent results of [6], in which a time poly(n) - 20/7) algorithm for 
computing the branch decomposition of the hypergraph G corresponding to a given CNF 
formula, is used as the basis for a resolution proof search algorithm. The decomposition 
algorithm is also based on algorithms of [98,100]. We shall discuss the algorithm of [6] 


in the subsequent section 3.1.2. 


2.3 Complexity of bipartite expansion 


In this section we investigate the complexity of the problem of determining (vertex) 
expansion in bipartite graphs (formal definitions to follow). We show some basic hardness 


results, and offer further conjectures on the hardness of determining bipartite expansion. 


2.3.1 Basic definition and results 


We begin by fixing a definition for the bipartite expansion problem informally described in 
the introduction to this section. Considering the definition of (bipartite) expansion, there 


are several possible parameterizations. We begin with the most general parameterization. 
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Definition 2.21 (Exact k, /-Contracting Set) Define the problem BIPARTITE EX- 
ACT CONTRACTING SET as follows: given bipartite G E€ Gi” and integer parameters 
k <m, € < n, determine if G contains an exact k,/l-contracting set; that is, a set 
S C L(G), |S| =k with |[(S)| < £. We abbreviate BIPARTITE EXACT CONTRACTING 


SET as BXCS. 


Clearly G contains a k, ¢-contracting set if and only if G is not (k, £/k)-expanding. 
Similarly, the following problems are the complements of determining if G is an expander 


or maximal expander, respectively. 


Definition 2.22 (k, /-Contracting Set) Define the problem BIPARTITE CONTRACT- 
ING SET as follows: given bipartite G € G? and integer parameters k < m, L < n, 
determine if G contains a k, ¢-contracting set; that is, a set S C L(G), |S| < k with 


IT(S)| < 0/k-|S|. We abbreviate BIPARTITE CONTRACTING SET as BCS. 


Definition 2.23 (Minimal k, -Contracting Set) Define the problem BIPARTITE MIN- 
IMAL CONTRACTING SET as follows: given G E G;” and integer parameters k < m, 
L< n, determine if G contains a minimal k, ¢-contracting set, i.e. determine if G con- 
tains a k, l-contracting set and for all S C L(G), if |S| < k then |['(S)| < 4/k - |S|. We 


abbreviate BIPARTITE MINIMAL CONTRACTING SET as BMCS. 


Thus G has an k, l-contracting set exactly when G is (k, ¢/k)-expanding, and G has 
a minimal k, ¢-contracting set if and only if ra(0/k) = k. We also consider the following 


parameterizations in which the expansion coefficient is kept fixed: 


Definition 2.24 ((Minimal,Exact) 6-Contracting Set) Let ô > 0 be arbitrary. De- 
fine the (parameterized) problem BIPARTITE EXACT 6-CONTRACTING SET as follows: 
given bipartite G € G? and integer parameter k < m, determine if G contains a set 
S C L(G), |S| = k such that |[(S)| < 9|S|. We abbreviate BIPARTITE EXACT ð- 


CONTRACTING SET as 6BXCS. 
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Define problem BIPARTITE 9-CONTRACTING SET as follows: given G E GI” and 
integer parameter k < m, determine if G has a set S C L(G), |S| < k where |['(S)| < 
9|5|. We abbreviate BIPARTITE 6-CONTRACTING SET as BCS. 

Define problem BIPARTITE MINIMAL -CONTRACTING SET as follows: given G € 
G? and integer parameter k < m, determine if G has a set S C L(G), |S| = k where 


IT(S)| < d|S|, but for all S$ C L(G), |S'| < k implies \[T(S’)| > 6|S’|.We abbreviate 


BIPARTITE MINIMAL 6-CONTRACTING SET as BMCS. 


Clearly for any G € G” and k < m, (G, k} is in BMCS (resp. BCS) if and only if G 
is not (k, 9)-expanding (resp. not a (k,d)-expander). Similarly, (G, k) is in MINIMAL 6- 
CONTRACTING SET exactly when ra(9) = k. One might further generalize the definition 
of BIPARTITE 6-CONTRACTING SET to allow 6 to depend on the input graph in some 
way (e.g. as a function of the degree). Conversely, we might also consider restricted forms 
of BIPARTITE 0-CONTRACTING SET where the set size depends in a particular way on 
the input graph, e.g. where the set size is am or m® for rational parameter a € (0,1). 

We first give simple membership results and relationships between the aforemen- 
tioned problems. Both BXCS,BCS are in NP for either parameterization. Further- 
more both BCS and BCS are both polytime reducible and parameterized-reducible to 
BXCS,0BXCS respectively. However, both parameterizations of BMCS are in the class 
DP? consisting of all languages expressed as Ly N Lı for Lo € NP and Lı € CoNP; in 
particular, BMCS = BCS N BCS“. The following naive algorithm decides both param- 
eterizations of BXCS: for each S C L(G) of size k, compute |D(S)| and accept if S is 


an exact k, l-contracting set (or k, 6k-contracting set for 9$BXCS). The naive algorithm 


m 


7) O(n) for G € GW} and size parameter k; to the knowledge of 


above takes time ( 
the authors, no algorithm achieves an better asymptotic time bound. The upper bound 
(7) -O(n) is approximately n*+1/k! and hence is neither polynomial nor fixed-parameter 
tractable. We conjecture, informally, that all forms of the ‘contracting set’ problem 


defined above are hard in both traditional and parameterized complexity. In the remain- 
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der of this section we investigate what negative results may be proven for any of the 
aforementioned problems. 

It is well-known that determining if a (balanced) bipartite graph G € G? is an (n/2, c)- 
expander is CoNP-complete for any constant c. In particular, the following special case 


of 1IBCSC was shown to be CoNP-complete in [25]: 


Definition 2.25 (Matcher) Define the problem MATCHER as follows: given a bipartite 


graph G € G? 


n? 


is it the case that for all S C L(G), if |S| < n/2 then there exists some 


T C R(G) such that |S| = |T| and there is a matching from |S| to |T|? 


Clearly the complement of MATCHER is equivalent to 1BCS with parameter k = 
|L(G)|/2 by Hall’s theorem. The MATCHER problem is shown to be CoNP-hard by 
reduction from a special case of CLIQUE which asks if a given graph has a clique on 
exactly half of the vertices. A slight modification of the reduction of [25] yields the 


following: 
Theorem 2.11 For any integer ô > 0, both BMCS and BCS are NP-hard. 


A complete proof of Theorem 2.11 is given in Appendix B. The following is then 


immediate from Theorem 2.11 and basic reductions. 


Corollary 2.12 For any integer 6 > 0, each of BCS, dBXCS and BMCS are NP-hard. 
Furthermore, each of BCS and BXCS and BMCS are also NP-hard. 


We may further modify the reduction of Theorem 2.11 to obtain the following result 


for ô < 1, which is of particular relevance in later sections. 
Theorem 2.13 For any 6 € (0,1), 6BCS is NP-hard. 


The proof of Theorem 2.13 is similar to that of Theorem 2.13, and appears in Ap- 


pendix B. Observe that Corollary 2.12 is a relatively weak result for BMCS and 
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BMCS, both of which belong to DP at the second level of the so-called boolean hier- 
archy (cf. [39,106]). By results from [73], if we could show BMCS to be complete for 
D”, then BMCS is not in NP unless the polynomial hierarchy collapses to the third level. 
However, we are unable to show that BMCS is hard for DP. 

The NP-hardness results above should be considered only a first step in characterizing 
the hardness of deciding our contracting set problems, and in fact we require much 
stronger hardness assumptions for our results in subsequent sections. Nonetheless, we 
proceed to consider the parameterized complexity of identifying contracting sets. As 
previously mentioned, the authors know of no better algorithm for finding contracting 
sets than the naive (7) - O(n) exhaustive search, which is clearly not an FPT algorithm. 
Since CLIQUE is W/(1|-complete [54], one might hope to show, in light of Theorem 
2.11, that BCS is W[1]-hard for at least some ranges of ô. However, the reductions of 
Theorems 2.11, 2.13 are not a parameterized reductions, as in both cases the parameter of 
¿BCS depends on the size of the CLIQUE instance. We can, however, show the following 


weaker result: 


Theorem 2.14 BXCS is W![1]-hard. 


Proof of Theorem 2.14 By parameterized reduction from CLIQUE. Let (G, k} be an 
instance of CLIQUE, and let n =gef |V(G)|, m =aer |E(G)|. Define bipartite graph H 
as follows: let L(H) = {ze : e € E(G)}, and let R(H) = {y : v € V(G)}. Connect each 
xe € L(H) to the vertices yu, Yy corresponding to the endpoints of e. Set K =der (5) and 
k' =4ef k + 1 and form instance (H, K, k') of BCS. Clearly H may be formed in time 
polynomial in |G|, and both parameters K and k' do not depend on G, hence we have 
defined a parameterized reduction. We now show that G has a k-clique if and only if H 
has a K, k'-contracting set. 

Suppose first that G contains a clique C on k vertices. Let S be the corresponding 


set of (5) edges between vertices of C in G, and let S = {ze : e € S}. Then |S"| = K, 
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and T(S’) = C hence |[y(S’)| = k < k’ as desired. 
Suppose then that H has a K,k’-contracting set S, and let S’ be the set of corre- 
sponding edges of G. Then T'y(S) corresponds to the set V’ of all vertices of G with 


both enpoints in S’, so |S’| > (AN, Then by assumption that [Dy(S)| < k’, we have 


that [Dy(S)| = k so V” is a k-clique in G. 


We are unable to prove a parameterized hardness result for the inexact or minimal 
versions of BCS. However, we conjecture that both BCS and BMCS are hard for at least 


wi]. 


2.3.2 Approximation and pseudo-approximation problems 


In this section we consider the follow optimization problem: 


Definition 2.26 (Minimal c-contracting set) For any c > 0, define the optimization 


problem MINIMAL BIPARTITE c-CONTRACTING SET (abbreviated MINcBCS) as follows: 
Instance: Bipartite graph G € GI”. 
Solution: a subset S C L(G) such that |T (S)| < c|S]. 


Objective function: the contracting set size |S]. 


Clearly an optimal solution S of MINcBCS for G has |S| = rí(c), as defined in 
section 2.1.1. Our main results in subsequent chapters concerning the automatizability of 
resolution require strong hardness assumptions on algorithms approximating MINcBCS. 
In particular, our main results depend on the hardness of approximating the decision 
form of MINcBCS to within even a superconstant ratio for certain classes of bipartite 
graphs. In this section we introduce such hardness assumptions and show how sufficient 


hardness results may be obtained from conjectures concerning average-case complexity. 
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We first observe that the corresponding (parameterized) decision problem for MINcBCS 
is cBCS, which we conjecture to be W[1]-hard. By Lemma 6 of [42], we immediately ob- 


tain the following: 
Theorem 2.15 For any c > 0, if MINcBCS admits an EPTAS, then cBCS € FPT. 


Thus if cBCS is W[1]-hard we have strong evidence that MINcBCS has no EPTAS. 
However as previously mentioned, our main results will require much stronger hardness 
assumptions than ruling out an EPTAS for MINcBCS. To this end we define the following 


notion of pseudo-approximation for MINcBCS: 


Definition 2.27 (Pseudo-approximation of MINcBCS) Fiz arbitrary parameter c > 
0. Let f : N — N and constant e > 0 be arbitrary. Say that an algorithm A is an (f,€)- 


pseudo-approximate algorithm for r¢(c) if, given instance G € GI” and k < m, 
e ifralc) < k then A outputs 1, and 


e ifralc+e) > f(m) - k then A outputs 0. 


The authors are not aware of any algorithm that pseudo-approximates (nor approx- 
imates) MINcBCS for any c, beyond the naive (exact) algorithm for BCS taking time 
proportional to (7): It turns out that certain strong hardness assumptions for pseudo- 


approximation of MINcBCS are sufficient to prove our main results in following chapters. 


2.3.3 Random graphs and average-case hardness 


In this section we discuss problems related to bipartite expansion of randomly chosen 
graphs. The study of random graphs originated in the work of Erdés and Rényi [57], and 
has since given rise to a wide body of work, in particular with applications to algorithmic 
design and average-case complexity analysis (c.f. [10, 29,76, 103]). We proceed by fixing 


some well-known distributions on graphs. 


CHAPTER 2. COMPLEXITY OF EXPANSION AND RELATED GRAPH PROPERTIES 40 


Definition 2.28 For any integer n > 1 and parameters p(n), 0 < p(n) < 1, define the 
distribution Bn pin) over Gr as follows: independently for each e € L(G) x R(G), choose 


e to be an edge of G with probability p(n). 


Definition 2.29 For any positive integers n,m and d < n, define distribution Bf, „ over 
d-regular graphs in G® as follows: Let L(G) = |m] and R(G) = [n]. Independently for 


each u € L(G), uniformly choose a d-element subset of R(G) as T(u). 


It is well-known that, for appropriate choices of parameters, random graphs drawn 
from By, p(n) and Bi n are likely to be good expanders (where the actual meaning of “good 
expander” depends on the particular application). As an example, we have the following 


results from [16, 22]. 


Proposition 2.16 (Lemma 2 [16]) Fiz n > 1 and m = cn for some c > 0. Let 
p = (Sum) and c = ®©. Then G ~ Bap is a (5, €) -boundary expander with probability 


at least 1/2. 


Proposition 2.17 (Lemma 2.23 [22]) For any constants d > 3, A > 1 and0<e< 


1/2, there exists constant k(d, e) such that for sufficiently large n, G chosen from Binn 


1+ 


is an (an, (1 + €))-expander with high probability for a = k(d, €) - A7 T=. 


In this section we will be concerned with problems, roughly speaking, of correctly 
identifying with high probability when a randomly chosen input graph is an expander. 
More formally, we consider probabilistic promise problems where inputs are “promised” 
to come from two classes of distributions, and an algorithms task is to identify which 


class a given instance is drawn from: 


Definition 2.30 (Probabilistic promise problem (PPP)) Let 2 be a finite alpha- 


bet. A probabilistic promise problem is a pair of sequences TI = (119, 111) where each TIS 
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is a set of distributions over X”. Let A be an algorithm for II. Define the soundness 


saln) of A as 


s(n) =des ma TRUET, 


and the completeness c4(n) of A as 


e(n) =def ai EEA) Mi 


Say that A solves II with success probability f(n) if for every n, c(n) — s(n) > f(n). 


We may refer to II? as NO-instances and T} as YES-instances. As a motivating 
example, we give a probabilistic promise problem due to Alekhnovich [3] based on the 
optimization problem of finding a maximum-Hamming weight satisfying subsystem to a 


set of linear equations over GF». 


Definition 2.31 (Average 3LIN) For A an m by n matrix over {0,1} and k € [m], 
define distribution D(A) as the distribution of the vector Ax + e where x € {0,1}” is 
chosen uniformly, and e € {0,1}” is independently chosen uniformly over all vectors of 
Hamming weight k. 

Define the Average 3LIN problem as follows for inputs m,n and e > 0: T} is the 
set of random pairs (A,,,b,) where A, is a random matrix {0,1} for which every row 
is the set of random pairs (An, Cn) where An is 


contains 3 ones, and by ~ Djen) (An). US 


n 


a random matriz {0,1} for which every row contains 3 ones, and Cn  Dreny+1[An)- 


Hardness conjectures for probabilistic promise problems are of particular use for prov- 
ing hardness for (deterministic) approximation problems (c.f. [3,66,91]). Roughly speak- 
ing, this is because the random nature of of YES or NO instances imparts additional 
structure. For a concrete example we present a hypothesis of Feige [91] concerning the 


problem of refuting the satisfiability of random 3-CNF formulas. We first fix a family 


d 


of distribution on CNF formulas, closely related to the distribution Bh, n on d-regular 


bipartite graphs. 
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Definition 2.32 For any positive integers n,m and constant d < n, define the distribu- 


tion Fr, ,,, over d-CNF formulas with m clauses over n variables, as follows: to choose 
formula f = {Ci,...,Cm} over variables {x; : j € [n]}, choose G according to distribu- 


tion Ba Assume R(G) = [2n], and for I C R(G) let£(1) = {a6 : i € 1,k= |1/2],€ =: 


m,2n* 


mod 2). Then for each i € [m], set Ci =L(T(1)). 


The distribution F’ n has applications in the theoretical and empirical analysis of SAT 
algorithms. A central question therein is, roughly, for what values of m (as a function 
of n) can an algorithm efficiently determine the satisfiability of formulas chosen from 
Fé n? Consider the case where m = An for constant A > 0. A well-known conjecture 
(cf. [34, 43]) states that, for each constant d > 0 there exists a constant satisfiability 
threshold Ag such that, for any constant A and sufficiently large n, f randomly chosen 
from Få, n is most likely unsatisfiable if A > Ay, and is most likely satisfiable if A < Ad. 
In fact it is proven in [63] that for d and n there exists such a threshold Ag(n), such that 
for any A and e > 0, formulas chosen from FA,,, are nearly certainly unsatisfiable if 
A > Ag(n)+e, and nearly certainly satisfiable when A < Ag(n) —e. It is shown in [1,71] 
that for each d,n, Ag(n) is bounded within a constant range, e.g. 3 < Ag(n) < 4.6, 


though it remains open whether Ag(n) converges to a constant. 


Definition 2.33 (Refute-3SAT) Define the Refute-3SAT problem as follows for pa- 
rameters A,e > 0 and input n: The YES-instances are 3-CNF formulas y of An clauses 
on n variables for which there is an assignment satisfying at least (1—ej An clauses. The 


NO-instances are random 3-CNF formulas p ~ Fann: 


Note that as no distribution is specified for YES-instances of Refute-3SAT, an algo- 
rithm for Refute-3SAT must always output 1 for YES-instances (and output 0 w.h.p. for 
NO-instances). Assuming the above conjecture that there exists a satisfiability threshold 


Aa for each constant d, it appears natural to conjecture that determining the satisfiability 
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of formulas from Få „n is a hard problem. Feige [91] formalizes this notion in what we 


will refer to as Feige's Hypothesis (abbreviated as FH). 


Conjecture 2.17.1 (Hypothesis 2 [91]) For any constant e > 0 and sufficiently large 
constant A, there is no polytime algorithm A for Refute-3SAT for parameters A,e with 


success greater than 1 — o(1). 


Definition 2.34 (R3SAT-hardness) Say that a problem L is random 3SAT-hard (ab- 
breviated as R3SAT-hard) if the existence of a polytime algorithm for L contradicts Con- 


jecture 2.17.1. 


Some remarks are neccesary to clarify the meaning of “sufficiently large”. The in- 
tended meaning is that A is a constant at least large enough so that random formulas 
from FA, are most likely unsatisfiable. In particular, it is easily shown that for suffi- 
ciently large A, every assignment to a random y ~ F ee satisfies roughly a 7/8 fraction 
of clauses. Thus Conjecture 2.17.1 may be considered, intuitively, as the hypothesis that 
it is hard on average to distinguish (1 — e)-satisfiable formulas from (the typical case 
of) (7/8 + e)-satisfiable formulas. In the realm of worst-case complexity, this informal 
inapproximability result is a well-known consequence of the PCP theorem (c.f. [14]). 
However, in [91], Conjecture 2.17.1 is used to obtain several new worst-case inapprox- 
imability results for several combinatorial problems, using reductions which rely on the 
additional structure of random 3CNF formulas. 

We now turn to the topic of bipartite expansion. In particular, we consider the 


following PPP in the vein of Refute-3S AT: 


Definition 2.35 (Average MINcBCS) Define the Average Regular-MINcBCS problem 
for parameters c > 0, f(-) and d > 0 and inputs m,n as follows: The YES-instances 
consist of d-regular bipartite graphs G € G® such that ro(c) < f(n). The NO-instances 


are randomly chosen graphs G ~ B? es 
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Define the Average MINcBCS problem for parameters c > 0, f(-), pC) € (0,1) and 
input n as follows: The YES-instances are bipartite graphs G € G? such that r(e) < 


f(n). The NO-instances are randomly chosen graphs G ~ Bn p(n): 


Thus algorithms for Average MINcBCS must correctly identify all graphs which are 
not (f(n),c)-expanding, but output 0 w.h.p. for randomly chosen graphs (which will 
likely be good expanders for appropriate choices of parameters). We now briefly consider 
the computational complexity of Average (Regular-)MINcBCS. We first observe that, 
if a randomly chosen graph is a YES-instance with probability q(n) then the optimal 
success of any algorithm is 1 — q(n). Thus to consider the hardness of solving Average 
(Regular-)MINcBCS with some non-trivial success, e.g. some constant s > 1/2, we 
restrict our attention to parameters so that a random graph is a YES-instance with low 
probability. We may fix examples for this discussion based on Propositions 2.17 and 2.16: 
we consider instances of Average Regular-MINcBCS where n = Am for constants A > 1, 
c < land f(n) = on for a’ less than the constant a of Proposition 2.17, and instances 
of Average-MINcBCS for constant c, p(n) > Q(logn/n) and f(n) < O(n/logn). For 
such instances, a randomly chosen graph will thus be a YES-instance with probability at 
most 1/2. However, the naive algorithm for MINcBCS consequently achieves success at 
least 1/2 in time proportional to (;) for k = a'n and O(n/logn), respectively. In both 
cases the time taken by the naive algorithm is much larger than quasipolynomial in n, so 
we might conjecture that these instances have no quasipolynomial time algorithms with 
success at least 1/2, although we offer no further evidence towards the plausibility of this 
conjecture. In fact, certian instances with c < 1 admit polynomial time algorithms with 
success at least 1/2, using algorithms for computing matchings of maximum cardinality 
in bipartite graphs. The following is a result of the well-known algorithm of Hopcroft 


and Karp [69]; the reader is referred to [101] for a survey of more recent advancements. 


Theorem 2.18 ( [69]) There exists algorithms which, given a bipartite graph G € G}, 


compute a maximum-cardinality matching in G in time poly(mn). 
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The distribution Bn p(n) is likely to contain a perfect matching for for p = Q(log n/n), 


as shown by the following result of Erdós and Rényi [58,59]: 


Theorem 2.19 Let p(n) = mate for constant c. Then for G ~ Bryn), 


lim Pr[G contains a perfect matching | = e”. 


Lemma 2.20 For any constants c € (0,1), d and function f where f(n) <n for alln, 
the problem Average MINcBCS with parameters p(n) = motd and f(n) has a polynomial- 
time algorithm with success s(n) satisfying 


lim s(n) = 1 — e” À. 


Proof of Theorem 2.20 Fix constants c,d and function f as in the lemma and let 


pln) = motd, Define the following algorithm: on input G € G”, determine in polyno- 
mial time if G contains a perfect matching. Answer YES is G does not contain such a 
matching and NO otherwise. Then if G is a YES instance, by Hall ’s Theorem (Theorem 
1.1), G does not contain a matching of cardinality greater than f(n) and the algorithm 


answers YES. The result follows by observing that by Theorem 2.19, NO-instances con- 


2e7 


tain a perfect matching with probability e~?° © as n tends to infinity. 


Moreover, Blum et. al. [25] show that for unbalanced graphs G € G? for n > m, one 


may efficiently check if every S C L(G), |S| = m has a matching: 


Definition 2.36 (Direct Concentrator) Define the DIRECT CONCENTRATOR prob- 
lem as follows: given G € G} forn > m, determine if every S C L(G), |S| = m has a 


matching into R(G). 
Theorem 2.21 (Theorem 2 [25]) The problem DIRECT CONCENTRATOR is in P. 


Thus we may obtain a result similar to Lemma 2.20 if the underlying distribution is 
likely to be an (m, 1)-expander. This property is, at the very least, not immediate for 


our example instances by Propositions 2.17 and 2.16. 
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2.4 Open problems 


In our brief survey of results for problems concerning expansion or similar properties, 
we find that problems of bipartite expansion are largely unresolved in comparison to 
the (general) graph problems such as computing vertex separators, sparsest cuts or tree 
decompositions. In particular, apart from the limited NP and W[1] hardness results of 
section 2.3, almost all problems we have defined of recognizing bipartite expander graphs 
are open. The exact parameterized complexity of —BCS and BMCS are open, which 
we conjecture to be hard for at least W[1]. There also may be other parameterizations 
of interest. For example, one may consider the treewidth of a graph as a natural pa- 
rameter; fixed-parameter tractability results for such structural parameters may suggest 
approaches towards efficient algorithms. 

The particular open problems we identify as being of interest are those related to 
pseudo-approximation of (bipartite) expansion, and to the average case hardness of bi- 
partite expansion as formalized in section 2.3.2. Our later results in chapter 3 depend 
on strong assumptions that is hard to distinguish graphs which are either “very poor” 
or “good” expanders. To this end we present two seemingly promising approaches to 
proving hardness for bipartite expansion. The first is by reduction from the MAX3AND 


problem of Feige [91]: 


Definition 2.37 (3AND formula) For d > 0 define a dAND formula as a set of dis- 
junctions of d literals each. We say that dAND formula f is randomly chosen from 


Fiy to mean that f is chosen by selecting g from Ft „ and replacing each clause by a 


m,n 


disjunction over the same literals. 


Theorem 2.22 (Corollary 1 [91]) For every e > 0 and sufficiently large A, the fol- 
lowing problem MAX3AND is RSAT -hard: given a random ¿SAND formula f from aos 


1. output 1 with probability 1 if f is at least (1/4 — e) -satisfiable, and 
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2. output 0 with probability at least 1/2. 


We conjecture that Theorem 2.22 may be used to derive R3SAT-hardness for bipartite 
expansion, by the intuition that a randomly chosen graph from Binon is most likely to 
be expanding, e.g. as formalized by Proposition 2.17, which shows that for any constant 
A, a random graph from Baal is expanding by at least 1 for all sets up to some constant 
fraction of n. Therefore if, for example, one could show that for sufficiently large A, the 
underlying graph for an exceptional (i.e. at least 1/4-satisfiable) 3AND formula cannot 
be a good expander in the sense of Proposition 2.17, then FH implies hardness results for 
the pseudo-approximation of MINcBCS, or Average Regular-MINcBCS for constant c. 
We are unable to show such a result by reduction from MAX3AND using known results 
along the lines of Proposition 2.17. More specifically, consider the graph G chosen from 
Bids corresponding to a randomly chosen 3AND formula f. By Proposition 2.17, G 
is (an, 1 + €) expanding for any constants e < 1/2 and a = «(3,e)A~?/'-9, However, 
in the exceptional case where f is at least 1/4-satisfiable, we are only able to show that 
there exists a Ín, n-contracting set (namely, the vertices of R(G) corresponding to a 1/4- 
satisfying assignment contain I (S) for the set S of satisfied constraints). The exceptional 
case is not disjoint from the typical case where G is an (an, (1 + €))-expander. 

Another promising avenue towards showing hardness of (pseudo-)approximating ex- 
pansion is the so-called quasi-random PCP verifier due to Khot [79]. A PCP verifier 
for SAT is a probabilistic algorithm A which takes an instance @ of SAT along with a 
purported proof m (e.g. a certificate that $ € SAT). Using O(logn) random bits and a 
constant number of queries of bits of 7, A performs a non-adapative test on the queried 


bits and either accepts or rejects; furthermore, A must satisfy the following properties: 


1. Completeness: if o € SAT then there exists 7 such that A accepts with probability 


1, and 


2. Soundness: if p £ SAT then for all 7, A accepts with probability at most 1/10. 
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Khot shows in [79] that there exists a PCP verifier for SAT making a constant d number 
of queries which is quasi-random in the following sense: for any test, the queried bits 
are distributed uniformly over all bits of the proof. In case $ € SAT and T is a valid 
proof, then there is a set of bits my C Tr, |mo] = |7r]/2 where at least (1 — o(1))21=4 
tests access all queries from 7. If $ ¢ SAT and ~ is a purported proof, then for any 
mW Cr, jr] = |r|/2, then a (1 + 0(1))2~¢ fraction of tests access all queries from To. 
More intuitively speaking, for 4 € SAT, the bits examined by the verifier are likely to 
lie in a particular range, wheras if p ¢ SAT, the bits queried are “mostly” random. 
This result can be seen as related to FH by the following combinatorial interpretation: 
consider a set of N bits of a proof on the right, and M possible tests (each randomly 
querying d bits) on the left. Then a choice of random bits for the verifier corresponds to 
choosing a random d-regular graph from M to N. If ¢ € SAT then about 1/29! tests 
have all neighbors (queries) contained in some set 7 of size N/2; otherwise, each set 7” 
of size N/2 entirely contains the queries of about 1/27 tests. Khot obtains similar, and in 
some cases stronger, inapproximability results for the same problems addressed by Feige 
in [91]. However, instead of FH, the results of Khot use the weaker hardness assumption 
that SAT does not have probabilistic 2”"-time algorithms for arbitrarily small constant 


e>0. 


Chapter 3 


Automatizability of Resolution and 


Expansion 


In this chapter we prove our main results concerning the problem of approximating 
minimum refutation size for the resolution system. That is, we establish results showing 
that the so-called automatizability of resolution implies pseudo-approximation algorithms 
for the MINIMUM BIPARTITE c-CONTRACTING SET problem of section 2.3.2. We first 
give necessary definitions and survey existing results concerning the approximation of 


minimum resolution size. We then proceed to prove our main results. 


3.1 Preliminaries 


3.1.1 Automatizability and proof-size approximation 


One motivation for the study of resolution is its relation to common approaches to solving 
the satisfiability problem. In particular, lower bounds on the refutation length for classes 
of formulas imply lower bounds on the worst-case running time of SAT algorithms which 
effectively produce resolution refutations for unsatisfiable instances, such as the DPLL 


algorithm [50]. Moreover, lower bounds for the refutation size of random formulas (cf. 


49 
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[92]) preclude such algorithms from being efficient on average. In light of these negative 
results, the following question is still open: is there an algorithm which efficiently finds 
a refutation, provided that a short refutation exists? This problem is formalized by the 


notion of automatizability, introduced in [31], which we define below for resolution: 


Definition 3.1 (Automatizability)) Say that an algorithm A automatizes resolution 
if, given a contradictory CNF formula T, where |r| =n and Sp(t F 0) = m, A produces 


a refutation of T of size poly(m) in time poly(n + m). 


We say that resolution is automatizable if there exists a (determininstic) algorithm 
which automatizes it. Similarly, resolution is quasi-automatizable if there exists an algo- 
rithm which produces a refutation of size 2P°¥!°8(™ taking time at most 2Polvlos(n+m)_ 
Automatizability is intimately related to the optimization problem of finding minimal 


length refutations, where algorithms are allowed running time polynomial in the size of 


an optimal refutation. We may formalize this as the following decision problem: 


Definition 3.2 (Minimum Refutation Size) Define the decision problem (UNARY) 
MINIMUM REFUTATION SIZE (abbreviated MRS) as follows: given unsatisfiable formula 
T and integer 1*, decide if Sp(t + 0) < k. Say that algorithm A f(n)-approximates MRS 


if on input (7, 1"), letting S = Sr(T E 0), 
1. if S < k then A outputs 1, and 
2. if S > f(k) then A outputs 0. 


This problem is also known in [15], for the case where f(-) is a polynomial, as the 
problem of separating the canonical NP/CoNP-pair for resolution for the polynomial f. 


In [15] the following is proven: 


Theorem 3.1 (Theorem 2 [15]) There exists a polynomial q and a polytime algorithm 
which q(n)-approrimates MRS, if and only if there is a proof system which is both p- 


equivalent to resolution and automatizable. 
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The proof of Theorem 3.1 in [15] is easily generalized to obtain the following analogous 


result for quasi-automatizability. 


Theorem 3.2 There exists a quasipolynomial q and a quasipolynomial-time algorithm 
which q(n)-approrimates MRS, if and only if there is a proof system which is both p- 


equivalent to resolution and quasi-automatizable. 


The familiar reader may note that our definition of automatizability differs from that 
of [31] and [5,6]. The original definition of [31] requires only that the running time (and 
hence proof produced) be polynomial in Sg(T F 0). As noted in [5], this is problematic as, 
for example, in the case of resolution we do not have a priori that |r| < Sr(T F 0). Thus 
in [5] it is proposed that an automatizing algorithm for resolution be allowed running time 
polynomial in |7|+ Sr(7 E 0), and hence to produce refutations of size polynomial in this 
quantity. We propose that, while an automatizing algorithm should be allowed running 


time polynomial in |r| + Sr(T F 0), it is reasonable to restrict the refutations produced 


to be polynomial in only Sr(7 F 0); this definition then accounts for the concerns of [5], 
yet preserves the intuitive relationships between automatization and approximation of 
minimum refutation size, without significant complication in comparsion to the definition 
of [5,6]. We remark that all known results cited in this and following sections still hold 
under our definition of automatizability. 

The notion of automatizability may be generalized by allowing the automatizing algo- 
rithm to produce a P-proof in time polynomial in the shortest R proofs, for two systems 


P and R. 


Definition 3.3 (Weak automatizability) Say that an algorithm A weakly automa- 
tizes resolution with respect to refutation system P if, given an unsatisfiable formula T, 
where |r| = n and S is the size of a minimal P-proof of T, A produces a resolution 


refutation of T of size poly(S) in time poly(n + S). 
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We simply say an algorithm weakly automatizes resolution if it weakly automatizes 
resolution with respect to some system P. It is left implicit that system P should be 
weaker than resolution with respect to p-simulation. We also define weak automatizability 


and weak quasi-automatizability analogously as for automatizability. 


3.1.2 Known results 


We now survey known results concerning the automatizability of resolution. Initial results 
of Iwama [70] show that MRS is NP-hard to compute exactly, or even approximately with 


bounded absolute error: 


Theorem 3.3 (Theorem 1 [70]) I/P 4 NP then there is no polytime algorithm which, 
given an unsatisfiable formula T of size n, finds a resolution refutation of T of size at 


most Sr(T F 0) + poly(n). 


Theorem 3.4 (Theorem 2 [70]) The problem of deciding if, given formula T of size n 


and integer k < poly(n), whether Sy[T F 0) < k, is NP-complete. 


Further progress for the hardness of approximating MRS is made in [2], in which a 
superconstant inapproximability result is proven modulo P 4 NP. We present a weaker 


form of their main result: 


Theorem 3.5 (Corollary 6 [2]) If P 4 NP, then there is no polytime algorithm which 


28D approrimates MRS. 


It is also proven that MRS is hard to approximate to any polynomial ratio if the 


parameter k is represented in binary: 


Theorem 3.6 (Theorem 24 [2]) Jf NP Ç P/poly, then for any polynomial p(n), there 
is no polytime p(n)-approximate algorithm for MRS with parameter k represented in 


binary. 
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Subsequently, Alekhnovich and Razborov [5] showed that both resolution and tree-like 
resolution are non-automatizable modulo parameterized complexity hardness assump- 
tions. Let FPR denote the randomized version of FPT, where algorithms are permitted 


to have one-sided error. 


Theorem 3.7 (Theorem 4.3 [5]) If resolution or tree-like resolution is automatizable, 


then W[P] C CoFPR. 


In fact the reduction of [5] also shows that the automatizability of (tree-like) resolution 
implies algorithms which approximate the W|P|-complete problem Minimum Monotone 
Circuit Satisfying Assignment to within any constant factor. At the time of writing, 
Theorem 3.7 is the strongest known hardness result for automatizability of (tree-like) 
resolution. 

For positive results, the Width Based Theorem Prover (WBTP for short) presented 
in [24] implies that tree-like resolution is quasi-automatizable. Based on fixed-parameter 
tractable algorithms for branch width (cf. [11,100]), Alekhnovich and Razborov present 
the Branch Width Based Theorem Prover (BWBTP) [6] taking time poly(|7|) - 220, 
where bw(T) denotes the branch width of formula 7. The branch width of a formula 
is essentially the tree width of the hypergraph over lit(7) with edges corresponding to 
clauses; see section 2.2.3. In particular, it is shown that wa(7 F 0) < O(bw(7)), and that 
BWBTP automatizes resolution for particular classes of formulas for which the minimum 
refutation length is tightly related to the branch width. 

Open problems include proving unconditional non-automatizability results for (tree 


like) resolution, and conditional non-quasi-automatizability for resolution. 


3.1.3 Size-width tradeoffs for resolution 


Most resolution lower bound techniques follow a certain form, as illustrated, for example, 


in [21]. That is, one considers how truth assignments “flow” through a refutation, i.e. 
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which lines of the proof are satisfied by particular sets of assignments. Each application 
of the resolution rule can be seen as making progress, in ruling out assignments, towards 
the final unsatisfiable clause 0. To prove lower bounds on size, it is first shown that 
any refutation must contain a complex clause which rules out a moderate fraction of all 
assignments, and that such a clause must have high width. A separate argument then 
shows that, given a purported small proof, one may restrict some variables to obtain a 
subrefutation in which no clause has high width and thus obtaining a contradiction. 
The above argument was refined even further by Ben-Sasson and Wigderson [24], who 
essentially reduce the problem of proving size lower bounds to that of obtaining lower 
bounds on width alone. In particular, they prove the following size-width tradeoffs for 


tree-like and general resolution: 


Theorem 3.8 Let 7 be an unsatisfiable CNF formula over n variables. Then 


Sr(7 F 0) > exp(wr(7 F 0) — w(7)) 


Sr(T E 0) > exp(Q([wa(r E 0) — w(7)1?/n)) 


Thus the problem of showing exponential lower bounds on refutation size amounts 
to showing that the minimal width wp(7 F 0) is large with respect to the initial axiom 
width and number of variables. A general approach is proposed (see also [22]) for which 
lower bounds on width follow from a certain expansion property for sets of constraints 
(i.e. CNF formulas), which we now briefly describe. For an assignment p and variable 
x € supp(p), define assignment p|x] to be exactly p but with p[x|(x) = 1 — p(x); we say 


that plx] is the result of toggling p on x. 


Definition 3.4 (Formula boundary) Let A be a set of CNF formulas. Define the 
boundary of A, denoted by OA, as the set of variables x such that there is a unique 


A’ € A and an assignment p on var(A) such that p falsifies A’ but plx] satisfies A. 
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Then given a unsatisfiable set of constraints A and a refutation of A (intepreted as a 
CNF formula), each clause C in the refutation may be associated with one or more sets 


of contraints A’ C A as follows: 


Definition 3.5 (Minimal implication) Say that A = C if any (possibly partial) as- 
signment satisfying all constraints of A also satsfies C. Given a set of constraints A, say 
that A' C A minimally implies clause C if A’ FC and for any B C A, if |B| < |A'| then 
BEC. 


Consider a clause C and a set A of constraints which minimally implies C. Then for 
each variable x € OA, one may take an assignment p which satisfies some A’ € A but 
falsifies C and A. But then one may toggle p on x so that p|x] satisfies A. Then C must 
contain the variable x, as otherwise C would still be falsified by plx]. 

Following this general approach, it then suffices to interpret an unsatisfiable CNF 7 
as a set of constraints A for which all moderately-size A’ C A have large boundary. For 
formulas based on an underlying graph structure (e.g. the so-called Tseitin formulas 
[104]), the above expansion property for constraints generally follows from expansion of 
the underlying graph (cf. [24]). 

Finally, we note that the size-width tradeoffs of Theorem 3.8 are known to be optimal, 
in the sense that explicit families of formulas have been demonstrated to have short 
refutations but relatively large width. For example, it is proven in [33] that a certain form 
of linear ordering principle MGT, over O(n?) variables satisfies wa(MGT;, F 0) > Q(n), 


but Sr(MGT,,) < poly(n), and hence the tradeoff of Theorem 3.8 is nearly optimal. 


3.1.4 Graph pigeonhole principle 


Recall that the pigeonhole prinicple states that for any sets A, B with |A| = |B|+1, there 
is no injective mapping f : A — B. The generalized form where the requirement |A| = 


|B| + 1 is relaxed to |A| > |B| is known as the weak pigeonhole prinicple. The negation 
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of the (weak) pigeonhole principle for sets of size m,n, m > n, is commonly expressed as 


the following contradictory CNF formula PHP” over variables {x;; : i € [m], j € [n]}: 


n] i1<iz<m 

Each variable x;; is interpreted to mean that pigeon 7 is mapped to hole j. Thus 
PHP" states that each pigeon 7 is placed in a hole, but no two pigeons occupy the same 
hole. It is well-known that PH P”_, requires refutations of size exponential in n (cf. [21]), 
and more recently it has been shown that the weak pigeonhole principle PH PJ” requires 
refutations of size nearly exponential in n [95]. In fact it is shown in [22,24] that most 
known lower bounds on the size of refutations of PH Pf” follow from lower bounds for 


the graph pigeonhole principle, defined as follows: 


Definition 3.6 (Graph pigeonhole principle) Fiz G € G'". Define the CNF for- 
mula PH P(G) as consisting of the following axioms: 
Po = y Ti for each i € L(G) 
jEr (i) 


HF =def A (Se Vis) for each j € R(G) 


j 
{41,t2} ET (5) 

Clearly PHP(G) is contradictory if and only if G € GP? for m > n. Observe that 

PHP.” is equivalent to the special case PHP(Km,n). Moreover, given a refutation of 

PHP(G), it is easily shown that by setting some variables to 0 we may obtain a refutation 


of PHP(G”) for a subgraph G’, hence we have the following: 
Proposition 3.9 For any m >n and G € GI”, SR(PHP(G) + 0) < Sr(PHP™ } 0). 


The minimum refutation size of PH P(G) is highly dependent on the expansion prop- 
erties of the underlying graph G. More specifically, PHP(G) requires large width to 
refute when both r¿(1) is large, and each moderately large set of pigeons (i.e. nearly 


ra(1)/2 many) are adjacent to many holes. Formally, we have the following: 
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Lemma 3.10 Fiz G € Gr” where m >n and let k =r¿(1). Then SR(PHP(G) + 0) < 
Sr(PHPE E 0) for some k' < k. 


Proof of Lemma 3.10 Fix G as in the lemma and set k = rġg(1); note that k > 0. 
Let S C L(G) be a set of k pigeons where |T (S)| < k and consider the subgraph G” on 
vertices S, T(S). Clearly PHP(G) contains the unsatisfiable subformula PH P(G’), so 
by Proposition 3.9 we have that SR(PHP(G) F 0) < SR(PH PE) for k' = |T'(S)|. 


We next give a lower bound on wa(PHP(G) F 0) in terms of the expansion of G. 


We apply the general argument described in section 3.1.3. 


Definition 3.7 (Pigeon complexity measure) Fir G € G” form > n. For each 


i € L(G) define the pigeon constraint AF to consist of pigeon axiom PE along with all 


2 


hole clauses C € HY such that raj € C. For I C L(G) let AF =def {AF : i € I}, and 
let AC denote the collection of all AF fori € L(G). Define the complexity measure [ug 


over clauses from var(PHP(G)) as follows: 
ua(C) =der min{|A] : AC AS, AFC) 
Proposition 3.11 For any G € Gi”, m > n, each of the following hold: 
1. for each clause C € PHP(G), pa[C) <1, 
2. pg(0) > roll), 


3. for any clauses D,, Də, C over var(PHP(G)) such that C is a resolvent of Dı and 


D2, La (C) < pa(Di) + ua (D2). 


Proof of Proposition 3.11 Fix G as in the proposition. Property 1 is immediate 
from definitions. To show property 3, fix clauses D1, D2,C such that C is inferred by 


resolving Dı and Də. Let P) and P} be minimal sets of pigeons such that Ag = Dı 
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and AG = Də. Then clearly P, U P, = C and |P, U Pa] < |Pi| + |Pa]. To show 
property 2, observe that for all P C L(G), |P| < rúa(1), we have that |[(P)| > |P]. 


Then by Hall’s theorem, each such P has a matching into its neighborhood, hence AS 


is satisfiable. Thus AG [4 0 for any |P| < rh(1). Proposition 3.11 is completely proven. 


Lemma 3.12 Fir G € GY form > n and parameter r < ra(1) —- 1. If G is an 


([r/2,r],c)-boundary expander, then wrR(PHP(G) + 0) > rc/2. 


Proof of Lemma 3.12 Fix G andr as in the lemma. Let 7 be a refutation of PHP(G). 
By Proposition 3.11, we have that ua(0) > r&(1) >r, ua(C) < 1 for any C € PHP(G), 
and ug is subadditive with respect to applications of the resolution rule. We conclude 
that m contains a clause C with r/2 < pa(C) < r. Let c > 0 be artbitrary and assume 
that G is a ([r/2,r],c)-boundary expander. It suffices to that show w(C) > rc/2. Fix 


P C L(G) such that AG = C and r/2 < |P| < r. By the expansion of G we have 


that |OP| > c|P| > rc/2. We claim that C contains some literal xj; for each j € OP. 
Suppose to the contrary that for some 7 € OP, hole j does not appear in any literal of 
C. Let i’ € P be the unique pigeon of P adjacent to j. Then by minimality of AZ, we 
have that there exists an assignment p such that Aru To = 1 but CÌ, 4 1 and hence 
ASe # 1. Since j ¢ T(P \ {i’}), we may assume w.l.o.g. that z; ¢ supp(p) for all 
i € T(j). Take p’ to be as p, but with p'(x,;) = 1 and p'(x;) = 0 for all i 4 t. By 


assumption that hole j is not mentioned in C, we have that Cl, = Cl, 41. But p 


satisfies constraint AG, hence AG}, = 1, a contradiction. Thus w(C) > rc/2 as desired. 


Thus we have established in Lemma 3.10 and Lemma 3.12, along with the size-width 
relations of Theorem 3.8, a correspondence with the expansion of G and the size of 
resolution refutations of PH P(G). In one direction, we have that if r¿(1) is small, then 


PHP(G) contains a small unsatisfiable subformula. On the other hand, if r¿¿(1) is large 
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then PH P(G) contains no small unsatisfiable subformula, and the width wa(PHP(G) F 
0) is proportional to the boundary expansion of G. Our main results in the remainder 
of this section formalize the above argument to reduce the pseudo-approximation of 


bipartite expansion to the approximation of Sp(PHP(G)). 


3.2 Basic reduction 


In this section we present several results demonstrating our basic reduction from the 
pseudo-approximation of bipartite expansion, as described in section 2.3.2, to the ap- 
proximation of minimum refutation size. The results of this section appeal only to known 
upper and lower bounds on the size of refutations of PH P(G), with the lower bounds all 


obtained via the width lower bound of section 3.1.4 and the size-width tradeoff. 


3.2.1 Resolution size bounds for PHP(G) 


In what follows, we require the following well-known upper bound on the size of refuta- 


tions of PHP"_, from [46]. 


Theorem 3.13 For any n > 0, there is a regular resolution refutation of PHP"*! of 


size O(n?) . 272. 
The following is then immediate from the above Theorem 3.13 and Lemma 3.10: 
Lemma 3.14 For anyn>0 and G € G",, Sr(PHP(G) E 0) < 20060), 


We also use the following lower bound obtained via the size-width tradeoff. In the 
interest of obtaining a simple bound, we assume that the expansion coefficient is propor- 


tional to the degree. 


Lemma 3.15 Let r be a parameter. For any constant 9 > 1/2 and any G € G, if G 


is both an (r, 1)-expander and an ([r/2,r],dd(G))-expander, then 


Sr(PHP(G)) > 2UQEL) 
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Proof of Lemma 3.15 Fix arbitrary parameter r, constant ô and G € G”_, as in the 
lemma. Let d =4ef d(G). Assume that G is an (r, 1)-expander as well as an ([r/2, r], 9d)- 
expander. Then by Proposition 2.1 G is a ([r/2,r], 9'd)-boundary expander for 6’ =¢e 
(26 —1) > 0. Then by Lemma 3.12 we have that wa(PHP(G) F 0) > 0'd -r/2. Observe 
that w(PHP(G)) = d and |var(PHP(G))| < dn. By Theorem 3.8, we have that there 


exists constant h > 0 such that for sufficiently large n, 


Sr(PHP(G)+ 0) > exp(h- (wa(PHP(G) E 0) — d)?/(dn)) 


= exp(h- d(ef(n)/2 — 1)*/n) 


Clearly ef (n)/2 — 1 is Q(f(n)), and the desired result follows. 


3.2.2 Main results 


In what follows we will use the bounds of lemmas 3.10 and 3.15 to obtain reductions from 
pseudo-approximation problems for bipartite expansion of a particular form. The inputs 
will be graphs from G_,; for simplicity we initially consider only d-regular graphs for 
arbitrarily large constant d. The YES-instances are graphs G for which r¢(1) < b, in 
which case G effectively contains PHP? | as a subformula. The NO-instances are graphs 
G for which r¿(1) >b for some bound b >> b, and which are good boundary expanders 
for sets of size proportional to b’. Thus if b’ is large enough, NO-instances require large 
width and hence large refutations by Lemma 3.15. We (informally) consider the afore- 
mentioned problem as the problem of pseudo-approximating MRS to a ratio of b'/b. Our 
reductions will obtain a pseudo-approximate algorithm taking time at least polynomial 
in SR(PHP(G) E 0) for YES-instances G, which by Lemma 3.10 is exponential in r¿(1). 

Our results below may be considered, informally, as hardness results for automatiz- 


ability modulo the following conjecture: 


Conjecture A. For any pseudo-approximation problem of the form described 
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above with ratio b/b', there is no algorithm with running time significantly 


lower than min( (), (7)) - O(n). 


Thus we aim to obtain reductions from pseudo-approximation problems with bounds 
b << b' for YES and NO instances respectively, for which 2°) is substantially smaller 
than both (o) and eae Under the assumption that pseudo-approximation is quantita- 
tively harder as b becomes closer to b’, we also wish to minimize the ratio b/b'. We now 


proceed to give our main results. 


Theorem 3.16 Suppose MRS is p(n)-approrimable by a polynomial time algorithm for 
some polynomial p(-). Then for any constants 0 < e < 1/2, 9 > 1/2 and d > 0, there 
exists a constant 0 < h < 1—e and algorithm A such that given d-regular graph G € G}, 


(ne 


A runs in time at most 2°, and 


1. (YES-instance) if rg(1) < n€ then A outputs 1, and 


2. (NO-instance) if ro(1) > nt? and G is an ({net"/2, ne"), $d)-expander, then A 


outputs 0. 


Proof of Theorem 3.16 Assume D is a polytime algorithm which n*“-approximates 
MRS in time nt for absolute constants c,t. Fix constants d,e and 6 as in the theorem. 
Set constant h so that 1 — € > h > 1 — 2e. Let e, denote the hidden constant of Lemma 
3.14. Define algorithm A as follows: fix as input a d-regular graph G € G?_,. Form 
r= PHP(G) and N = 2%” in time poly(n) + poly(N) and form instance y = (7, 1%). 
Run algorithm D on instance y, taking time (|7| + NV), and output whatever D outputs 
as the answer of A. Since |r| is O(n?), clearly A satisfies the desired running time bound. 
It suffices to show the output of A is correct. Suppose first that r¿(1) < nt. By Lemma 
3.14 we have that Sa(T E 0) < 29760 for n sufficiently large, in which case D outputs 1. 


Suppose then that G is both an (n°, 1)-expander and an ([n*+*/2,n***], 6d)-expander. 
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Then by Lemma 3.15 we have that Sp(r H 0) > 20(1n22%2 for some constant co and 
sufficiently large n. Then for sufficiently large n we have that 


do ne+2h-1 


SRT H 0)/N* = 241 


By our choice of h, the above quantity is 22%) for some e” > 0, hence Salr F 0) is 


greater than N°. Therefore D outputs 0 as desired. 


We then obtain the following simplified corollary of Theorem 3.16, which follows 


immediately from Theorem 3.1 and Proposition 2.1: 


Corollary 3.17 Suppose resolution is automatizable. Then for any constants 0 < e < 
1/2, 6 > 1/2 and sufficiently large d > 2, there exists a constant 0 < h < 1—€ and 


algorithm A such that given d-regular G € G”_,, A runs in time 2009, and 
1. ifra(l) < n€ then A outputs 1, and 


2. ifralód) > ne" then A outputs 0. 


We note that Theorem 3.16 and the subsequent Corollary 3.17 hold even if the set 


size parameters in the YES and NO instances are an‘ and agn**” 


respectively, for any 
constants a1,a2 E (0,1). With respect to Conjecture A, observe that Theorem 3.16 
obtains a ratio of at most ni“ for e < 1/2. Furthermore, using a well-known bound on 


binomial coefficients, we find that for any e € (0, 1), 


n > ay — 92(n* logn) 
ney \ne 


which is bounded above the running time of 2°) for the pseudo-approximate algorithm 
of Theorem 3.16. By the same reduction and proof technique of Theorem 3.16, we may 
obtain a similar result, but for a pseudo-approximate algorithm running in polynomial 


time. 
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Theorem 3.18 Suppose that MRS is p(n)-approximable in polynomial time for polyno- 
mial p(-). Then for any constants e > 0, 6 > 1/2 and sufficiently large integer d > 2, 


there exists constant e” > 0 and polytime algorithm A such that given d-regular G € G}, 


1. ifra(1) < elogn then A outputs 1, and 


2. if G is an (e ynlogn, ód)-expander then A outputs 0. 


Proof of Theorem 3.18 Let D be a polytime, p(n)-approximate algorithm for MRS. 
Let t,c be constants such that p(n) < n° and D runs in time at most nt. Fix constants 
e,9 and d as in the theorem. Let € be a constant with value to be determined later. Let 
€¡ , €2 denote the hidden constant of lemmas 3.10, 3.15 respectively. Define algorithm A as 
follows: given as input a d-regular graph G € G”_,, form instance y =der (PH P(G), 1%) 
for N =ger n° (clearly this may be done in time polynomial in n). Run D on input 
y, taking time (O(n?) + NY = poly(n). Finally, output whatever D outputs. Clearly A 
runs in time polynomial in n. We now show that A is correct. Suppose first that r¿(1) < 
elog n. Then by Lemma 3.10 we have that for sufficiently large n, SR(PHP(G) F 0) < N, 
hence D outputs 1. Suppose then that G is an (€ynlog n, 6d)-expander and take d large 
enough so that dd > 1. Then by Lemma 3.15, for n sufficiently large we have 

Cte 


= odez-e?-logn 


Sr(PHP(G) E 0) > 242 


Setting e > y/cee1/(de2), we see that SR(PHP(G)) > N° and so D outputs 0. 


With respect to Conjecture A, we find that the pseudo-approximate algorithm of 
Theorem 3.18 acheives a ratio of Oy/n/log n in polynomial time. Again using a lower 


bound on binomial coefficients, we see that 


n n clogn 2 loglog n 
> a ge log n(l- Zioen), 
elogn) — Lelogn 


Since (loglogn/logn) vanishes as n increases, this bound is superpolynomial for large 


enough n. 
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Even in light of our strong informal assumption (Conjecture A), the results of 3.16 
and 3.18 should be considered very weak as evidence for the hardness of automatizing res- 
olution; in both cases, the running time of the pseudo-approximate algorithms obtained 
by reduction are not strongly bounded below the running time of the naive algorithm. 
In particular, a more sophisticated reduction, or stronger upper and lower bounds for 
PHP(G), are necessary to prove hardness of quasi-automatizability modulo Conjecture 


A. Consider the following: 


Proposition 3.19 Suppose that resolution is quasi-automatizable. Then for any con- 
stants Cy > 0,€9 > 0,6 > 1/2 and sufficiently large d, there exists constants €,,c, > 0 


and quasipolynomial-time algorithm A such that given d-regular graph G € G}, 
1. ifra(1) < eg log” n then A outputs 1, and 


2. if G is an (ey/nlog” n, 0d)-expander then A outputs 0. 


Proposition 3.19 may be proven in the same manner as Theorem 3.18; we give a 
proof in Appendix B to avoid distraction. Using a well-known upper bound on binomial 


coefficients, we have that 


n c n elog"n 
A z ef leg n, ( - ) x nPolylogn 
celog n celog n 


and therefore Proposition 3.19 is trivial as a hardness result for quasi-automatizability. 


3.3 Simulation of Res(k) to increase the refutation 
size gap 


In this section we show that resolution is not (quasi-) automatizable unless there exists 
algorithms which violate certain conjectures that identifying bipartite expansion is hard 


on average (see Section 2.3.3). The particular hardness assumption we will use is stronger 
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than the pseudo-approximation hardness assumptions of Section 3.2. However, to the 
author's knowledge, our main result of this section is the first conditional hardness results 


for the quasi-automatizability of resolution. 


3.3.1 Preliminaries 


We state our main results for this section as well as necessary definitions for the associated 


reduction. 


Definition 3.8 For any integer n > 1 and parameters c(n), p(n), 0 < p(n) < 1, define 
the distribution By p(n)le(n)] over GÍ as follows: independently for each e € L(G) x 


R(G), choose e to be an edge of G with probability p(n). 


Our main result of this section shows that Resolution is not quasi-automatizable 
unless there exists quasipolynomial-time algorithms with success at least 1/2 for a certain 
(contrived) instance of the Average MINcBCS problem of Section 2.3.3 for unbalanced 


bipartite graphs. 


Conjecture 3.19.1 For all constants a,b > 0, there is no quasipolynomial-time algo- 
rithm A which, given a randomly chosen graph G ~ B ogn [n + n/log* n], behaves as 


follows: 
1. ifra(1/2) < n/log* n, then A outputs 1, and 


2. A outputs 0 with probability at least 1/2. 


Theorem 3.20 Suppose that Conjecture 3.19.1 holds. Then there is no quasipolynomial 


time, 2P'vl8(") anproximate algorithm for MRS. 


Theorem 3.21 Suppose that Conjecture 3.19.1 holds. Then Resolution is not quasi- 


automatizable. 
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Our reduction employs a novel technique used in [15] to demonstrate efficiently con- 
structible formulas having small refutations but requiring relative large width, which in [5] 
is identified as necessary for obtaining non-quasi-automatizability results. We proceed 


first with the necessary definitions and results of [15]. 


Definition 3.9 (k-disjunction form) Fix integer 0 < k < 2n and formula f on n 
variables. Define the k-disjunction form of f, denoted f[k], as follows: for each subset 
L € lit(f) of size at most k, define new variable zz, to represent the conjunction of all 
LE L. Define f|k] to be f along with the following formula for each zg, L = {21,..., £s}: 
(azi V... V ae V ZL) A^ (Arz va). 
1<s 
Note that f[k] has width max{w(f),k +1) and size at most |f|+(k+1)(7) < 
|f| + O(n*). The following results then relate the size of minimal refutations of f[k] to 


the size of minimal Res(k) refutations of f. 


Lemma 3.22 (Lemma 1 [15]) For any CNF f and k > 0, if f has a Res(k) refutation 


of size S, then f[k] has a resolution refutation of size O(kS). 


Lemma 3.23 (Lemma 2 [15]) For any CNF f and k > 0, if F[k] has a resolution 
refutation of size S then f has a Res(k) refutation of size O(kS). 


We also require the following well-known result from [88] which proves the existence 


of quasipolynomial-size Res(k) refutations of PH P?” for k = polylog(n). 


Theorem 3.24 (Theorem 11 [88]) There exists constants c,d > 0 such that for n 


sufficiently large, PHP?" has Res(log° n) refutations of size glogtn 


The proof of Theorem 3.20 exploits lemmas 3.22 and 3.23, for k = polylog(n), by 
forming the formula PHP(G)[k]. For YES-instances, PH P(G)[k] will have a small 


refutation of size roughly quasipolynomial in n. We then show that for NO-instances, 
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i.e. random graphs, with probability at least 1/2 for randomly chosen G, S;,(PHP(G)) 
is at least 218°” for arbitrarily large constant a (in particular, a can be chosen arbitrarily 
large provided |L(G)| is small enough compared to |R(G)|). 

The most involved portion of our subsequent proof of Theorem 3.20 is showing a suf- 
ficient lower bound on Sr(PHP(G)[k]) which holds with high probability for randomly 
chosen G. Observe that, while a randomly chosen graph will likely be a very good ex- 
pander (as demonstrated by Proposition 2.16), the size-width tradeoff of Theorem 3.8 
is insufficient to prove strong resolution lower bounds for PH P(G)|k], since the trans- 
formation to PH P(G)[k] increases the number of variables to roughly quasipolynomial 
in n. There are several known strong lower bounds on the size of Res(k) refutations of 
the (weak) pigeonhole principle. Particularily, in [102] it is proven that with non-zero 
probability, for constant p € (0,1) and G ~ Bnp, PH P(G) requires Res(k)-refutations 
of size 2” for some e > 0, provided that k < vlog n/loglog n. This result is improved 
upon in [96] for k < O(logn/loglogn). Both results are insufficient for our purpose as 
we require k > polylog(n); moreover, we are unable to adapt the proof technique to even 
give a weaker lower bound for large enough k. 

On the other hand, it is well-known that Res(polylog(n)) is an instance of a so-called 
bounded-depth Frege system. There are several superpolynomial bounds for bounded- 
depth Frege refutations of PH P"”_,, for example [84,94]. However, these results are 
insufficient for our purposes, as we require a lower bound for PHP(G) which holds 
with high probability for randomly chosen G. We will adapt a result of [37] in which 


quasipolynomial lower bound is shown for PHP?” for m < n + y Specifically, in 


a Ns 
polylog(n 


subsequent sections we prove the following: 


Theorem 3.25 Fix constants c > 0 and h > 4, and let b =4ef 8(he + 3), and d(n) =def 
log’n. Then there exists constant a > 0 such that for sufficiently large n, if G ~ 
Bn dm) nn + n/ log’ nj, then with probability at least 1/2 any Res(log"n) refutation of 
PHP(G) has size at least 2208" ™ 


CHAPTER 3. AUTOMATIZABILITY OF RESOLUTION AND EXPANSION 68 


To motivate the material of following sections, we first describe at a high level the 
proof strategy we use for Theorem 3.25. We assume for contradiction's sake that there 
is a short Res(k) refutation m of PHP(G). We define decision trees which represent 
the lines of such a refutation, in that the boolean function computed by a line and its 
representing tree agree on a large class of restrictions. We then apply a switching lemma 
which asserts that, with high probability over the random choice of G, given that 7 is 
small, we may apply a restriction p such that each 7, line is represented by a decision 
tree of small height. A separate argument shows that the restricted refutation still must 


have a decision tree of large height, yielding the desired contradiction. 


3.3.2 Matching decision trees and monotone Res(k) refutations 


In this section we give basic definitions necessary to prove Theorem 3.25. 


Definition 3.10 (Matching restrictions) Let G be an arbitrary bipartite graph. Say 
that two edges of G are inconsistent if they share exactly one endpoint; say that two sets 
of edges A,B are inconsistent if there exists an inconsistent pair in A x B. A partial 
matching on G is a set p of consistent edges; we denote by P(p) (H(p)) the set of 
j € L(G) (R(G)) appearing in p, and denote supp(p) = P(p) U H(p). Say that p is an 
l-matching if |p| = £. 

Each l-matching p on G is uniquely identified with a matching restriction p' over 
var(PHP(G)) as follows: p'(ai;) = 1 for all (i,j) € p, p(xij) = 0 for all (i,j) € E(G) 
inconsistent with p, and p(xj;) = x otherwise. We denote by supp(p) the set of all 


u € V(G) mentioned in p. 


For simplicity, for fixed graph G we will implicitly identify edges of G by variables of 
var(PHP(G)) and vice versa, hence we use matching restrictions and partial matchings 
interchangeably. In what follows, we also assume that any bipartite graph G € G)” 


has an implicit ordering of its vertices, so that DNF formulas over var(PHP(G)) are 
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implicitly ordered lexicographically according to the order on G. That is, we let supp(t) 
for t a term denote the set of v € V(G) appearing in t. Then for a DNF f all terms t are 


ordered according to supp(t). 


Definition 3.11 (Matching disjunction) Let G be a bipartite graph. A matching dis- 
junction for G is a monotone DNF formula f such that for each term t € f, the set 


{(i,j) : viz € f} is a partial matching on G. 


Definition 3.12 (Matching decision trees) For G a bipartite graph, a matching de- 
cision tree for G is a tree T, along with a labelling nr from internal nodes of T to vertices 
of G, a labelling er from edges of T to edges of G, and a labelling lr from leaves of T to 
{0,1} such that 


1. if np(u) =v then er(u) is incident on v, and 


2. for each path in T from the root to node u, the edge-labels of the path comprise a 


partial matching, denoted pathy(u). 


Say that T is a leaf-unlabelled matching decision tree if T satisfies the definition above 
except has no labelling ly. For T a matching decision tree, define T to be T but with 
lpe(u) = 1 — lr(u) for each leaf u. 

Define Br(T) as the set of all pathy(u) for u a leaf of T. Define Bre(T) to be the 
set {p € BrT : r(p) = ej. For any T we define the matching disjunction disj(T) to 
contain all terms corresponding to p € Bri(T); that is, disj(T’) = V serra) Na.jep Tis: 
We denote the height of T by h(T). 


Definition 3.13 (Full matching tree) Let G be a bipartite graph and K C V(G). De- 
fine the full matching tree for K over G, denoted Fal K), as the following leaf-unlabelled 
matching tree, defined inductively: If K is a single vertex u, then Fa(u) consists of root 
labelled by u connected to leaves for each v € Falu), where each edge is labelled with the 


edge (u,v) of G. Otherwise for |K| > 1, let u be the largest node of G in the canonical 
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ordering and let T = Fa(KX {u}); we construct Fa(K) as follows. For each leaf v € T, 
if u £ supp(pathr(v)), then label leaf v with vertex u and add edges from v for each edge 


e € E(G) not incident on supp(pathp(v)). 


Definition 3.14 (Canonical decision tree) Let G be a bipartite graph and f a match- 
ing disjunction for G. The canonical decision tree for f over G, denoted as Ta(f), is 
defined inductively as follows: if f € {0,1} then Calf) is a single leaf labelled by f. 
Otherwise, let t be the first term of f by the canonical ordering. Then Ta(f) consists 


of the full matching tree T = Falsuppít)) with each leaf u of T replaced by the tree 


is (f path(u)) g 


Definition 3.15 (Complete and representing matching trees) Say that (leaf-unlabelled) 
matching tree T is complete if for each internal node u, {path(v) : (u,v) € E(T)} is the 
set of all matchings obtained by adding a single edge to path(u). Say that a decision tree 


T represents a matching disjunction f if for each leaf u of T, flpatn(u) = £r (u). 


Observe that for any G, f the decision tree Ta(f) is always leaf-labelled, complete, 
and represents f. The following results show that matching decision trees are robust for 


matching restrictions: 


Proposition 3.26 (ref) Fix bipartite graph G. For any matching decision tree T, 


matching p, and matching disjunction f over G, 
1. dis? )| ¢= dis7 (23): 
a ae (ThS 


3. IfT represents f, then TÌ, represents ff). 


The essential property we require of canonical matching decision trees is as follows: 


CHAPTER 3. AUTOMATIZABILITY OF RESOLUTION AND EXPANSION 71 


Lemma 3.27 (Lemma 1 [37]) Let G be a bipartite graph and T a complete matching 
tree for G. Let d denote the minimal degree of any u € V(G) mentioned in T. For any 
partial matching p on G, if d > max{|p|,h(T)}, then there exists p € Br(T) that is 


consistent with p. 


The preceding definitions are intended to allow a Res(k) refutation to be analyzed by 
associating with each line a canonical matching decision tree which represents that line. 
In general, the lines of a Res(k) refutation are not matching disjunctions. However, for 
the case of PH P(G) we may transform a Res(k) refutation into a form where each line is 
in fact a matching disjunction, analogous to the so-called monotone or positive calculus 
refutations used to prove resolution lower bounds (cf. [38,97]). Our “monotone” Res(k) 
refutations will operate on only positive literals, by interpreting =x; to mean that some 
pigeon other than i is placed in hole 7. We formally define a monotone refutation system 


for PH P(G) as follows: 


Definition 3.16 (Clash operator) Fix bipartite graph G. Define the clash operator 
Lg over var(PHP(G)) as follows: xij La try iffi A i and j! = j. Similarily, for 
I,l’ Cvar(PHP(G)), I lol ifx Lx foralxel, el”. 

Let f be a k-DNF over var(PHP(G)) for some bipartite graph G. The monotone 
form of f, denoted ft¢, is ak-DNF containing only positive literals, formed by replacing 
each =x; with the maximal clause C such that C La xij, then expanding the resulting 
expression to a DNF. For F a set of DNF formulas we use F*¢ to denote the set {f*¢ : 


f EF}. 


Definition 3.17 (PC(k) refutation) Fix k > 1 and bipartite graph G € G}, m > 
n. A PC(k) refutation of PHP(G) is a sequence t = (fi,..., fc) of DNF formulas 


containing only positive literals, where fe = 0 and for each i < £, either fi = AtS for 
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AE PHP(G), or fi follows from f;,, fio € T, 11 < i2 < i, by one of the following rules: 


z for k-DNFs C and D 


CVz1, CV Ni=2..j Li 
CV Niet] Ti 


for j < k, k-DNF C, 21,...,2; € var(A) 


CiVA, y Li, C2VV ; 21 Gi A 
a > forj < k and k-DNFs C,,C, %1,..., £j E var(A), 


such that gi La x; fori € [j]. 
We refer to the above rules as subsumption (or weakening), AND-introduction and cut, 


respectively. 


Note that for any hole clause C = (22 ;¡ V 72;), CTS is the clause Vier) Zij, Which 
we denote as HE. It is easily verified that applying the monotone transformation to 
each line of a Res(k) refutation yields a PC(k) refutation of at most the same size, so 
PC(k) lower bounds are sufficient to prove Res(k) lower bounds for PH P(G). Most 
importantly, the following properties of Res(k) refutations are preserved by PC(k) under 


matching restrictions: 


Lemma 3.28 Fir k > 1 and letn be a PC(k) refutation of PH P(G) for some bipartite 
graph G. For any f € n, if f is inferred from fı, fo, then for each set of terms tı € 
fi, te E fo such that ty,t2 contain no literals xı, £2 such that xı La x2, then there exists 
some term t € f such that for any matching restriction p, if tif, = 1 and tol, = 1 then 


tl =1. 


Lemma 3.29 Fir k > 1 and letn be a PC(k) refutation of PH P(G) for some bipartite 
graph G. For any matching restriction p over var(PHP(G)), mI, is a PC(k) refutation 
of PH P(G},). 


We prove Lemmas 3.28 and 3.29 in Appendix B to avoid distraction. We next show 


how a monotone refutation may be put into a form where each line is a matching dis- 
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junction, which will permit the application of Lemma 3.31. A proof of the following also 


appears in Appendix B. 


Definition 3.18 (Matching normal form) Let G be a bipartite graph and f a mono- 
tone DNF over var(PHP(G)). The matching normal form of f over G, denoted Malf), 
is the matching disjunction consisting of all terms t € f*S for which all literals of t are 


consistent. 


Clearly for any G and DNF f = AV B, Malf) = Mal A) V Ma(B). Furthermore 


or any clause C containing only positive literals, Ma(C) = C. 


Lemma 3.30 LetG be a bipartite graph and n a Res(k) refutation of PHP(G) for some 
k > 1. Then there is a PC(k) refutation of PHP(G) of size at most 3\r| where every 


line is a matching disjunction. 


3.3.3 The switching lemma 


We present a switching lemma for the purpose of proving lower bounds for Res(k) refu- 
tations, suitable for use with k = polylog(n). The term switching lemma owes to similar 
results used to prove lower bounds on boolean circuits, where it is argued that, upon 
applying a randomly chosen restriction to a DNF formula with small terms, the result- 
ing formula may be represented by a CNF of small width; we refer the reader to [18] 
for a more detailed background. We first define the distribution from which random 


restrictions will be selected. 


Definition 3.19 (Random restriction) For a bipartite graph G € Gi", S C R(G) and 
integer d, define M€(G) as the set of all partial matchings p in G where im(p) = S and 
Gl, has maximum degree d; we simply write Ms(G) to denote MẸ (G). For any r € [n], 


we define M, (G) = Uscria)¡sj=r Ms(G). 
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For r € |n], define the distribution M,(G) by the following procedure for choosing 
a matching p: Uniformly choose S C R(G) of size r. If Ms(G) is empty then p = 0. 


Otherwise, uniformly choose p from Ms(G). 


Note that if G contains a matching over all of R(G) then p ~ M,(G) = @ with 
probability 0. We now state the switching lemma for matching restrictions. The following 
Lemma 3.31 is based on Lemma 8 of [37], but with the distinction that the probabilities 


involved are only over choice of random restrictions. 


Lemma 3.31 (Switching lemma) Fiz n > 0 and constants a,c,s > 0 such that c — 
6(a +3) < 0 and s < Glog*n. Set b =der Ela +3), r =aep N(1 + 1/log”/*n) and 


A Saef 6log’/*n. Let H be a bipartite graph from G"*! satisfying the following: 
1. the maximum degree of H is at most 6 log’ n, 
2. Ms(G) £90 for all S C R(G), |S| =r, and 


3. |M8(H)|/|Ms(H)| > 1- 27092 for at least a (1—3/n) fraction of all S C R(G), 
Poy =r. 


Let f be a matching log" n-disjunction over var(PH P(H)). Let p~ M,(H) and denote 
S =qef holes(p). Then the conditional probability that h(Ti,(f1,)) = s, given that events 


p € MA(H) and|M&(H)|/|Ms(H)| > 1-27 82 occur, is at most 2(72010g 72 n)s/?, 


The proof of Lemma 3.31 is essentially the proof of Lemma 8 of [37] with some 
simplifications, in particular by fixing the underlying graph H. The complete proof 
appears in Appendix B to avoid distraction. We apply the switching lemma with a union 
bound to show that for a small enough set of matching disjunctions 7 over var( PH P(G)), 
with non-zero probability a restriction p ~ M,(G) permits each f € m to be represented 


by a short matching decision tree: 
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Lemma 3.32 Fizn > 0 and constants a,c > 0 such that c — 6(a+ 3) < 0. Set b =der 
8(a +3), r =aep N(1+1/logP/Bn) and A =der 6 log’/*n. Let H be a bipartite graph from 


G"*t! satisfying the following: 
1. the maximum degree of H is at most 6 log’ n, 
2. Ms(G) £90 for all S C R(G), |S| =r, and 


3. |M8(H)|/|Ms(H)| > 1- 27092 for at least a (1—3/n) fraction of all S C R(G), 
eos ee 


Ife < b/2, and r is a set of at most 3-21%8*7 matching log" n-disjunctions over var(PHP(H)), 
then the conditional probability over p ~ M,(H), given that the events p € M(H) and 
|Mities(p)(H)|/|Mnotes(o)(H)| > 1-2-8" occur, that Tm, (flp)) < log*n for all 


fer is at least (1 — 3/n). 


Proof of Lemma 3.32 Fix n,a and c as in the lemma, and let b,r be as defined 
in the lemma, and assume that c < b/2. Let H € GP! be a bipartite graph sat- 
isfying all assumptions of the lemma and 7 a set of at most 3 - 218”? matching dis- 
junctions over var(PHP(H)). Let B denote the event p € M(H) and C the event 


a+l 


|M(H)|/|Ms(H)| > 1-27 18% ” for S =gef holes(p). Let A(f) denote the event that 
h(T,(flp)) 2 108g%n for f € m, and let A denote the event (Aye, A(f). By Lemma 3.31 
we have that Pr[A(f)|B,C] < 2(720log® "/”)!e*"/? for all f € m. Then by the union 
bound, we have 


Pr[A|B, C] < Y PrjA($)|B, C) 


fer 


<3. glog* n i 2(720 log?—/2)los® n/2 


< 3log"n - 2 PEN, 


where the last inequality follows for sufficiently large n since c < b/2. Thus Pr[54|B,C] > 


(1 — 3/n) as desired. 
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3.3.4 Lower bound 


We now present the proof of Theorem 3.25. Ignoring parameters for the moment, in 
what follows we show that for G a randomly chosen graph, with high probability, for any 
PC(k) refutation 7 of size at most S there exists a matching p such that each line of rr], 
has h(TG¡,(f1,)) < s. By previously established properties of PC(k), rr), is a refutation 
of PH P(G],). A separate argument shows that any PC(k) refutation of PHP(G],) 
must have a line f where h(T@,(f)) is at least s, hence no such small PC(k) refutation 
exists. We first formally state the first part of the above formal argument as the following 


lemma: 


Lemma 3.33 Fiz n > 0, constants c,h > 0 where h > 4, and let a = hc. Set k =def 
log"n, b =qep Bla + 3), r =gef n(1 + log 8 n), M =p N+n/ log’*t n and d 4H 
log?n. For each G € Gir let ra be an a arbitrary set of matching k-disjunctions over 
var(PHP(G)) of size at most 3-218*7. Then for n sufficiently large, with high probability 


over the choice of G ~ Bnajn|m], there exists some p € M,(G) such that: 
1. for all f € re, h(Tat,(ftp)) < log* n, and 
2. Gl, has minimum degree at least log! n. 


In the interest of obtaining a simple proof for Lemma 3.33, we first prove some 
technical results concerning the distribution B,, a/n[m]. Note that it is only in the following 
results that we employ probabilistic analysis of random graphs. Unlike previous results of 
this thesis, we do not use (boundary) expansion properties of random graphs, in the sense 
of Proposition 2.16. Thus the following demonstrates part of the power of Conjecture 
3.19.1 in proving non-automatizability results, in the use of lower bound techniques which 
employ the analysis of random graphs. Our first result is a strengthened form of Lemma 


6 of [37]; we present its proof for completeness. 
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Lemma 3.34 Fiz n > 0,b> 3 andm>n-+n/logb+1n and set d =4ef log?n. Then 
for G ~ Ba ajnml, the probability that G contains a matching on all of R(G) is at least 


127108 Pn for sufficiently large n. 


Proof of Lemma 3.34 Fix n,b,d and m as in the lemma and consider G ~ Brayn: By 
Hall’s Theorem, the probability that G does not contain a matching over all of R(G) is 
less than the probability that there exists H C R(G) with at least n —|H| non-neighbors 
among the first n vertices of L(G). By the union bound and definition of B,, 4/n[m], this 


probability is bounded by 
|) (1-d/ny ron = ( i (1 — d/n I + ( ) (1—d/ny 0-9 
J ja J j=n/2+1 puna 
n/2 
NO [P -d/ny 1! + See 21 d/ny]? 
j=1 j=n/2+1 
n/2 
a Sf 2 N ija oa n2 ean" 
j=l j=n/24+1 
n/2 


< ` Deer very, 
j=1 


where we have used the well-known bounds (") < n and (1— x) < e”. Substituting 


IA 


d = log’ n, the desired bound follows. 


Note that Lemma 3.34 implies that with high probability, G ~ By a/n[m] has Ms(G) # 
Ü for all subsets S C R(G). We will require the following technical facts adapted from 


[37]: 


Proposition 3.35 (Lemma 4 [37]) Fiz n > 0, b > 3 and m >n+n/log’t'n, and 


set r —def n(1 + log 7/8 n) and d =def log? n. Let G ~ Barani = r]. Then the 
probability that G has some vertex with degree greater than 6 log?/* n is at most 27 18P n 
Furthermore, the probability that G has some vertex with degree less than log’/® n/2 is at 


— Jogb/8-1 
most 27 108 A 
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Proposition 3.36 (Lemma 5 [37]) Fizn >0,b>3 and m > n+n/log"*!*n and set 


d =def login. Let G ~ Bn alm]. Then the probability that G has some vertex with degree 


greater than 6 log’ n is at most 2718" 'n, 


Lemma 3.37 Fizn >0,b andm>n+ n/log?*? n and set d =def log? n, T =4ef N(1 — 
log7B np) and A =def 6log’/*n. Then for G ~ Bnajn{m|, the conditional probability, 
given that Ms(G) is non-empty for all S C R(G) of size r, that |MÉ(G)|/|Ms(G)| > 


1 — 27108”? n for at least a (1 — 3/n) fraction of all S C R(G), |S| =r, is at least 2/3. 


Proof of Lemma 3.37 Fix n,b,m and d,r,A as in the lemma and consider G ~ 
Bna/n|m]. Let P denote the event that Ms(G) is non-empty for each subset S C R(G) of 
size r. It is easy to verify that, for any matching p € M,(G), the graph GT, is distributed 
by Ba ajnim) for n! = nlog"4 n [37]. Thus by Proposition 3.35, for any S C R(G), 


|S| = r, the conditional expectation of Rs(G) =aer |M$(G)|/|Ms(G)| given P is at 


b/8-1 


least 1 — 278% "7. Then the random variable Xs =1—|M%£(G)|/|Ms(G)| is bounded 


away from 0 so by Markov’s inequality we have that the conditional probability that 


Xs < n25 log’ n < 9-108/8n is at most 1/n. Let Ys be the indicator for the event 


b/8-2 y, 


M#(G)|/|Ms(G)| < 2718 , and let Y =4e _, Ys. Then by linearity of 
S f SCR(G),|S|=r 


expecation, E[Y|P] < wo. Applying Markov’s inequality once more, we have that the 


conditional probability that Y > 3(”) is at most 1/3. 


Proof of Lemma 3.33 Fix c,h, n and a as in the lemma, and let k,b,r,d and m be as 
defined in the lemma. Let A denote the event that there exists a restriction p satisfying 
property 1 of the lemma. Let B(p), for p a matching restriction of size r, denote the 
event that G], contains a vertex of degree at least log*** n. It suffices to show an upper 
bound on Pr[=A] + Pr[=B(o)] for arbitrary o € M,(G). 

We first bound Pr[4]. Let D denote the event that G has maximum degree at most 


6log’n, M the event that Ms(G) is non-empty for all S C R(G) of size r, and R the 
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event that |M$(G)|/|Ms(G)| for at least a (1—3/n) fraction of S C R(G) of size r. Then 
if D, M and R each occur, G satisfies the requirements of Lemma 3.32. By definition 
c < b/2 and |r| < 3 - 218°”, thus given D, M and R, the probability over p ~ M,(G) 
that h(Ta;,(f1,)) is at least log” n for some f € m is bounded away from 1. Thus the 
conditional probability (over choice of G) of A given D, M and R is 1 for n sufficiently 
large. Therefore 


Pr[-4] < Pr[=D] + Pr[5M] + Pr[-R|M] 


b-1 n 


< 27 108 


where the second inequality follows from Proposition 3.36, Lemma 3.34 and Lemma 
3.37 respectively. By definition, llog’ n > log**' n so by Proposition 3.35, Pr[=B(c)| 
is at most 27!8”""'” for any o € M,(G). Thus the probability that, either there is 
no restriction p satisfying property 1, or that an arbitrary p satisfying 1 has G], with 
minimum degree below log**? n, is at most 


1 4 g-log“tn b/8-1 p b/8-2 y, 
3 


+ 97 log p 97 log ; 


which is clearly bounded above by 1/2 for sufficiently large n. 


We now carry out the second part of the preceding informal argument, by showing 
that any Res(polylog(n)) refutation of PH P(G) contains a line for which the canonical 


matching tree has large depth. 


Lemma 3.38 Fizn > 0 and letc,h > 0 be arbitrary constants with h > 4. Seta =a4ef he, 
b =4ef Sla + 3) and M =dep nN+N/ logt! n. Fiz G € G” with minimum degree at least 
log°t' n and suppose that 7 is a PC(log®n) refutation of PHP(G) in matching normal 
form. Then h(Ta(f)) > log" n for some f Er. 


Proof of Lemma 3.38 Fix n,c,h as in the lemma and let a,b and m be as defined in 


the Lemma. Let G € G? and 7 a PC(log"n) refutation of PH P(G) in matching normal 
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form be given. Assume to the contrary that each f € m has h(Tg(f)) < log” n. We prove 
by induction on steps that Ta(f) has leaves all labelled 1 for each line f € m. Since 7 is a 
refutation, 0 € 7 and Tg(0) is a single leaf labelled 0, and thus we obtain a contradiction. 

The base case, where f = PF or f = HY for some i € L(G) or j € R(G), follows by 
inspection and the definition of Tg(f). For the inductive step, let f € m be derived from 
fi, fo € T and assume that Te( fı), Tal f2) have all leaves labelled 1. Fix p € Br(Ta(f)) 
and suppose for contradiction that the corresponding leaf has label 0. Since Ta(f) rep- 
resents f, we must have that p falsifies each term of f. Since h(Ta(f)), h(Ta(f)) and 
h(Ta(f2)) are all less than log” n and the degree of any vertex of G is at least log”** n, 
we may apply Lemma 3.27 twice to extend p to a matching restriction p' which extends 
some cı € Br(Tg¢(fi)) and o2 € Br(Ta(f2)). By hypothesis, each of 0, and oz satisfy 
some terms tı € fı and ta € fa respectively, hence p’ satisfies tı and to. Thus by Lemma 


3.28 there exists a term t € f satisfied by p”, contradicting p € Bro(Ta(f)). Therefore by 


induction T¿(0) has leaves all labelled 1, which by inspection of T¿(0) is a contradiction. 


We may now proceed to prove Theorem 3.25. 


Proof of Theorem 3.25 Let n > 0 be arbitrary. Fix constants c > 0 and h > 4 
and let d(n) =der log®"°*®) n. Set a = 8(hc+3) + 1 and let m =gep n+n/log"n. Let 
G ~ Brdn)nim). By Lemma 3.33, with probability at least 1/2 there exists a matching 


heti na. and any set 7 of at 


restriction p such that GJ, has minimum degree at least log 
most 3-2!°8"°" matching log” n-disjunctions over var( PH P(G)) has h(Tet,(ftp)) < log*n 
for all f € 7. We claim that in the event that such a restriction exists, there is no 
Res(log°n) refutation of PH P(G) with size at most 2!°8"°". Suppose for contradiction 
that 7 is a refutation of PH P(G) for such a graph G where |r| < 2los""n By Lemma 


3.30 there exists a PC(log®n) refutation a’ of size < 3|r| with every line in a matching 


log® n-disjunction. Let p be a restriction such that G], has minimum degree at least 
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logt! n and MT (fp) < log**n for all f € n’. By Proposition 3.26 and Lemma 3.29, 
Tr], is a PC(log®n) refutation of PH P(G},) where each line f’ has h(Tgp,(f")) < log” n. 


hc+1 


Since GT, has minimum degree at least log n, this contradicts Lemma 3.38 hence no 


such short Res(log® n) refutation can exist. Thus with probability at least 1/2, PH P(G) 


has no Res(log®n) refutation of size at most 2!°s"*”, 


3.3.5 Main result 


In this section we prove Theorem 3.21, i.e. that resolution is not quasi-automatizable 
unless Conjecture 3.19.1 is false. We first prove the stronger result for MRS, i.e. Theorem 


3.20. 


Proof of Theorem 3.20 Fix constants h, t and algorithm A which alos" "-approximates 
MRS in time 2!°8'", We show there exists constants a,b > 0 and algorithm M such that 
given G ~ Ba, 1080 njnln + n/ log? n'], if r@(1/2) < n/log* n then M always outputs 1, and 
that M outputs 0 with probability at least 1/2. 

Define k(n) =qef log’ n. Let a,b be positive constants; we will specify values for a 
and b later. Define s(n) =aef n+n/log*n and p(n) =der log?n/n. By Theorem 3.24 
and Lemma 3.22, there exists constants c,d such that Sp(PHP2"[k(n)]) < 28% for 
arbitrary n. We proceed to describe the algorithm M. Fix n > 0. For input G € Ga 
form the formula f = PH P(G)|k(n)]|, taking time O(poly(n) + (cea) )e which is at most 
quasipolynomial in n by definition of k(n). Set S =ge¢ k(n) - gles” and form instance 
y = (f,1°) of MRS. Then M simulates A on instance y and outputs whatever A outputs. 
Let d' be a constant such that |f| + S < 210847. Then A runs in time at most glo +n 
hence M takes time quasipolynomial in n. It suffices to show that M has the desired 


behaviour. 


Suppose first that r¢(1/2) < n/log"n. Then there exists P C L(G) of size r < 
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n/log* n with |[(P)| < r/2. Then PH P(G)|k(n)| contains, as a subformula, the unsat- 
isfiable formula PH P(G")|k(n)| for the subgraph G” induced by P. Then by definition 


of k(n) and Lemma 3.14 we have that 
Sp(PHP(G)[k(n)]) < 28°" < gos n, 


hence A((f,1°)) = 1. 

For the remaining case, set constant h’ so that h’ > max{4c,d + h} and set b =def 
8(h' + 3) and a = b+ 1. Consider G ~ By pn)[s(n)|. By Theorem 3.25, with probabil- 
ity at least 1/2, Sym (PHP(G)) > 208°”, and by Lemma 3.23, Sr(PHP(G)[k(n)]) > 
gles" /k(n) with probability at least 1/2. By definition of h’, we have that for n suffi- 
ciently large, with probability at least 1/2, 


d+h n 
y) 


Sa(PHP(G)|k(n)]) > 208" > 21 


hence A((f,1°)) = 0 with probability at least 1/2. 


Proof of Theorem 3.21 Follows immediately from Theorem 3.2 and Theorem 3.20. 


We conclude this sections with some remarks on the strength of Theorem 3.20. A 
particular requirement for (conditional) non-quasi-automatizability results identified by 
Alekhnovich and Razborov [5] is that the upper bound used in proving the reduction 
not hold for tree-like resolution, which is in fact quasi-automatizable by the width-based 
theorem prover algorithm of [24]. Recall that the width-based theorem prover, or WBTP, 
(see section 3.1.2) produces a refutation of an unsatsfiable formula 7, |r| = n in time 


O(wr(TF0)) 


n . However, this algorithm does not render Theorem 3.21 trivial, as shown by 


the following: 


Theorem 3.39 (Theorem 13 [15]) Letc be a sufficiently large constant so that PH P?"|log® n] 


r a > d a 
has a resolution refutation of size at most 28% for some constant d. There exists a 
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graph G € GY with m > 2n such that wa(PHP(G)llog"n] F 0) > Q(n/log"n) but 
Sr(PHP(G)[log"m] E 0) < 2Polvlogín). 


Theorem 3.39 also ensures that the branch-width based theorem prover of [6] (see 
section 3.1.2) cannot find a refutation of PH P(G)|log® n] in quasipolynomial time. 

Considering the hardness assumption of Conjecture 3.19.1, we have little evidence to- 
wards the plausibility that our contrived probabilistic promise problem is in fact hard for 
quasipolynomial time algorithms. We note that the naive algorithm which exhaustively 
checks |P(S)| for all |S| < n/log* n takes time roughly 


( i J (log! n)n/ leat m= (En 
n/log* n 


which is much larger than quasipolynomial in n. Conversely, our choice of set size param- 
eter n/log* n for YES-instances precludes the problem being solvable in polynomial time 
by checking if G contains a matching into R(G), since G € G? for m > n+n/log* n may 
contain a matching into R(G) as well as a set |S| < n/log*n with |[(S)| < |S|/2. This 
is essential as our proof of Theorem 3.25 essentially requires that a randomly chosen G 
contains a matching into R(G) with high probability. However, we are not able to prove 
that the distribution of Theorem 3.25 is not likely to be (n, 1)-expanding, in which case 
Theorem 2.21 implies a polynomial-time algorithm for the instance of Average MINcBCS 
used in our reduction. 

We finally note that the proof of Theorem 3.21 goes through with few changes if we 
omit the use of the k-disjunction form and simply use Res(log® n) in place of resolution, 


where c is any sufficiently large constant, giving us the following: 


Corollary 3.40 Suppose Conjecture 3.19.1 holds. Then for any sufficently large constant 


c, Res(log® n) is not quasi-automatizable. 
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3.4 Conclusions and open problems 


We have demonstrated several reductions, from computation problems related to bipartite 
expansion, to the (quasi-)automatizability of resolution. We show that the well-known 
and natural graph pigeonhole principle formulas allow a simple reduction from problems 
of distinguishing poor expanders from very good expanders. If strong evidence for the 
hardness of such expansion problems may be obtained, we hope this basic reduction 
may lead to a drastically simplified proof of non-automatizability results of [5] which 
further enrich the known connections between expansion and proof size. Furthermore, 
by applying the techniques of [15] to our basic reduction, we obtain a novel non-quasi- 
automatizability result for resolution. We also demonstrate the utility of an average-case 
hardness conjecture for expansion of random graphs, which may be of some independent 
interest. 

Given our lack of proofs or evidence for the hardness of the expansions problems used, 
we have not resolved the question of the (quasi-) automatizability of resolution, and so 
this problem is left open. Also left open are the problems of resolving the complexity of 
identifying bipartite expanders (see section 2.4). 

In reference to section 3.2, we ask if the lower bound for PH P(G) may be improved as 
a function of (boundary) expansion, so that the gap between the expansion size parameter 
for the ‘yes’ and ‘no’ instances might be reduced. Conversely, we also ask if there is 
some efficient construction such that the gap may be widened: for example, is there a 
transformation f(G) on bipartite graphs such if G is a poor expander, i.e. rG(1) < n/2, 
then ryg)(1) is much smaller than n/2, but if G is an (n/2, d) expander for d > 1 then 
so is f(G)? A major limitation to our results of section 3.2 is that the proof size in the 
YES instances is exponential in r¿(1), but for NO instances we require the expansion 
size parameter, and hence the width, to be linear in n to get exponential lower bounds. 
Can we prove that SR(PHP(G) E 0) > 2°waPAP(G)F0))? Such formulas are defined 


as smooth in [6], where it is shown that smooth formulas may not be suitable for non- 
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automatizability results. 


The results of section 3.3 present several open problems and areas for further work. 
The most pressing open problem is the veracity of Conjecture 3.19.1. Is there any strong 
evidence for or against the veracity of Conjecture 3.19.1, or even a similar conjecture with 
different parameters? We may also consider similar conjectures for the distribution Ba 
Can we prove even quasi-polynomial lower bounds which hold with high probability for 
PHP(G),G~ Bin for any degree parameter d? It is not immediately apparent how 
we might modify proof technique of Theorem 3.25, or the techniques of [102], for the 
distribution pos Moreover, can we do away with Conjecture 3.19.1 entirely, identifying 


a weakest possible assumption so that our quasi-polynomial lower bound of 3.25 holds? 


We also ask if our result can be improved, in the sense of using a more (even in- 
formally) plausible assumption, using the proof techniques of [96, 102]. In [102] an 
exponential lower bound is shown for Res(k) refutations of PHP(G) using expansion 
properties of random graphs from Bn pin) [2n] for p(n) = Q(Inn/n). However, the bound 
is only non-trivial for k < y/logn/loglogn. In [96] a similar bound is shown for k < 
O(log n/loglog n). Can these techniques be improved for k up to polylogn? In partic- 
ular, it seems desirable to obtain lower bounds for p(n) small enough so that random 


graphs are not likely to contain a matching into R(G). 


We also ask whether there exists other ‘natural’ formulas other than PHP(G) for 
which we may apply k-disjunction form technique of Theorem 3.20 to obtain similar 
or stronger results. In particular, [102] shows formulas which admit polynomial size 
Res(k + 1) refutations but require exponential size Res(k) refutations; perhaps there are 
natural graph-based formulas for which S(T + 0) depends crucially on k, in relation to 
some structural parameter. 

As discussed in the preceding chapters, a particular motivation for the study of au- 
tomatizability comes from the study of SAT algorithms related to weak proof systems 


such as resolution, for which superpolynomial lower bounds are known. For such algo- 
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rithms the problem of finding proofs efficiently in cases where short proofs exist remains 
of great practical interest. Several state-of-the-art SAT algorithms are based on a back- 
tracking search over all assignments, which, as previously noted, essentially produce 
regular resolution refutations on unsatisfiable instances. More recently, the technique of 
clause learning has been used to augment backtracking search algorithms for SAT with 
empirically successful results. The technique of clause learning developed from appli- 
cations in artificial intelligence, where backtracking search algorithms were augmented 
with the ability to remember “explanations” for backtracking points; this technique was 
then implemented for SAT instances where the explanations take the form of clauses 
(cf. [20,52,72]). As explored in [19, 20,64], there is practical and theoretical interest in 
characterizing clause learning (henceforth CL) as a propositional proof system for the 
theoretical analysis of its complexity. Given an appropriate definition of CL as a refu- 
tation system, we may ask if CL p-simulates general resolution, and if CL is (quasi-) 
automatizable, or weakly automatizable with respect to regular resolution. However, the 
problem of appropriately defining CL as a refutation system is an obstacle to work on 
these problems. It appears natural to define CL as a refinement of resolution. To this end 
Van Gelder [64] proposes the so-called pool resolution system. Roughly speaking, a pool 
resolution refutation is a resolution refutation which satisfies the following combinatorial 
property: there is a depth-first spanning tree of the underlying DAG which is regular in 
the sense of Definition 1.7. In [64] it is shown that pool resolution cannot be p-simulated 
by regular resolution, and that pool resolution is powerful enough to characterize most 
clause learning algorithms. Somewhat surprisingly, we are able to show that pool reso- 
lution is in fact not a propositional proof system in the sense of [47,48], unless P = NP; 
we present a proof of this result in Appendix A. Furthermore, in [17] it is shown that 
pool resolution is nearly as powerful as (general) resolution; consequently, the following 


is proven: 


Theorem 3.41 (Theorem 4.1 [17]) Jf pool resolution is automatizable then resolution 
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By Theorem 3.7 it then follows that pool resolution is not automatizable unless 
W[P] C FPR. We ask whether there is a simple combinatorial characterization of CL 
which is a propositional proof system in the sense of [47,48]. We also ask if there is 
such system which p-simulates resolution, or otherwise, if there is such a system which 


is automatizable, or weakly automatizable with respect to regular resolution. 


Appendix A 


Identifying depth-first regular 


refutations is NP-complete 


A.l Preliminaries 


We first define depth-first regular resolution (henceforth DFREG) as a system of refuta- 
tions, based on the pool resolution system defined by Van Gelder [64]. For this purpose 
it is convenient to define the resolution rule as a function mapping a pair of clauses and 


a variable to a single clause: 


Definition A.1 (Res function) Define the function Res as follows: Res takes as input 


two clauses A,B and a variable x, and outputs a single clause according to the following: 


e Ifx € A and =x € B, then Res(A, B,x) = A V B' where A’ and B' are exactly A 


and B respectively, except that any mention of x has been removed from them. 


e If, on the other hand, x ¢ A, then Res(A, B,x) = A. We call this the degenerate 


resolution rule. 


Definition A.2 (Depth-first regular resolution) Let A be a set of clauses and C a 


clause. A depth-first regular derivation m of C from A is a DAG with a single source node 
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called the root. The root of n is labelled with clause C and each sink node is labelled with 
a clause from A. Each non-sink node u has out-degree 2 and is labelled with Res(A, B, x) 
where x is a variable A and B are the clauses labelling the two children of u. For each 
non-sink node u with children v,w, if u is labelled with Res(A, B,x) where A is the label 
of v and B the label of w, then the edges from u are labelled with literals as follows: if 
a literal x€ appears in A, then the edge (u,v) is labelled with x€ and (u,w) is labelled 
with a+~©. Otherwise, (u,v) and (u, w) are arbitrarily labelled with distinct literals of x. 
Furthermore, n must be depth-first regular in the following sense: there exists a depth- 
first spanning tree T ofr such that, on every path p in T, the underlying variables of all 
literals appearing as edge-labels in p are distinct. 

A depth-first regular refutation of a set A of clauses is a depth-first regular derivation 


of the empty clause from A. 


It is not hard to show that any DFREG refutation may be converted to a standard 
resolution refutation, and that the DAG representation of a regular resolution refutation 
is also a DFREG refutation, hence DFREG is sound and complete. It is known that 
regular resolution cannot p-simulate DFREG ( [64], but it is open whether DFREG p- 
simulates (general) resolution. In what follows, we show the following somewhat suprising 


result, introduced in Section 3.4: 


Theorem A.1 Depth-first regular resolution is not a propositional proof system in the 


sense of [48], unless P = NP. 


A.2 Main result 


We now prove Theorem A.1 by reduction from the well-known exact cover by 3-sets 


problem, which we state below for completeness: 


Definition A.3 (Exact 3-Cover) Fix n and let X be a set of size 3n. Let C be a finite 


collection of 3-element subsets of X. A cover of X is a subcollection C’ C C such that 


APPENDIX A. IDENTIFYING DEPTH-FIRST REGULAR REFUTATIONS IS NP-COMPLETE90 


each x € X appears in some c € C”. A cover C’ is exact if each x € X appears in at 
most one c EC”, 

The EXACT 3-COVER problem, abbreviated X3C, is defined as follows: given X of 
size 3m and collection C of n 3-element subsets of X, determine if C contains an exact 


3-cover of X. 


It is well-known that X3C is NP-complete, as proven in [77]. Thus to prove Theorem 
A.1, we reduce X3C to the problem of, given a resolution DAG r~, formula A and clause 


C, determining if 7 is a DFREG derivation of C from A. 


Proof of Theorem A.1 Consider an instance y = (X,C) of X3C. We construct a 
formula y and refutation (DAG) a of y so that 7 is a DFREG refutation if and only if 
y € X3C. We first state the constructed formula y as follows, where i ranges over [n] 
and j ranges over [3m], where n = |C| and 3m = |X|. For each C} € C, arbitrarily select 


some 2; such that x;, € C}. 


ke[n] ke[n] 
Zi =2i 
Z; =z V bj 
Xi =2; 


X; =T; V Dti, V DTi, V “4; V dj, 
where {£i , Ziz, Zig} = C; 

j =j V aC; 

L; =Cj V ad; 


Note that our refutation m includes resolution on variables not appearing in any of 


these above clauses. Thus we assume that any variable x resolved upon in 7 which does 
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not appear in the above set of clauses is contained in a unit clause {x}. We now describe 
the refutation m. In what follows, if the degenerate rule is applied without specifying 
the variable to be resolved upon, the variable resolved upon is implicitly a new variable 
distinct to that application of the rule. 

For each j € [n], derive the unit clause =b; as follows: take initial clauses 2;,, Zi, Ziz 
such that Lea tartot = Cj. Derive z; and z; from 2;,,2i, and 2;,, Zi, respectively, 
using the degenerate rule resolving on variable cj. Derive 2;, from 2;, and z; from the 
preceding steps by the degenerate rule. Resolve z;, from the previous step with axiom 
Z; on variable z;, to obtain bj. 


Define clauses C (7), C’(j) as 


CU) = Nav N b for. <j <n, 


kelj] ke[j] 
C= A av A bk, for Ose E 
kelj] keļj—1] 


where C(0) is the empty clause. We inductively derive C(n),C(n — 1) down to C(0) 
using the unit clauses =b,, j € [n] from the previous step. First note that C(n) is 
exactly the initial clause C,. Suppose then we have derived C(j + 1) for 7 < n; we 
show how to derive C(j). Let {£i , Ziz, £i} = Cj. Resolve C(j + 1) with 7b;4, from 


the previous step to obtain C’(j + 1). Resolve C'(j + 1) with X},, on ajy1 to derive 


X3 = C(j) V 74a, V ati V aTi; V d;¡41. Resolve X3 with initial clause x;, to obtain 
Xə = C(j) V 225, V “ti V dj41. Resolve Xə with initial clause x;, to obtain Xy = 
C(j) V =z; V dj41, and resolve Xy with x; to obtain R = C(j) V dj4i. 

We then derive C(j) V =dj+1 as follows. Resolve C'(j + 1) with £L;+1 to obtain 
L' = C(j) V 7¢;41. Apply the degenerate rule once to derive L’ from L’ and X3, and 
twice again to derive L’ from L’ and X; and from L’ and X4. Resolve L with L; +1 tO 
obtain L = C(j) V 2d;+1. Resolving with R then yields C(j). We refer to the above 
derivation of C(j) from C(j + 1) as the group component for Cj. Figure A.1 illustrates 


the group component for C; in 7. 
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Figure A.1: Derivation of C(j) from C(j + 1). 


We now show that the constructed DAG ~ is a DFREG refutation if and only if 
(X,C) is a yes-instance of X3C. We assume that each z; € X appears in some C; € C, 
since otherwise (X, C) is trivially a no-instance of X3C. Thus every initial clause x; and 
zi, 1 € [Bm], appears in 7. 

Suppose first that there exists an exact 3-cover C’ C C for X. We give a regular 
depth-first traversal of 7. The traversal begins at the root C(0). Fix j € [n] and suppose 
the traversal is at C(j — 1). We use the naming of clauses in the group component for 
C; from the above figure. 

If C; € C’, then proceed first to R, then visit x; before X, and so on until reaching 
Xs. Visit X},,, then proceed to C’(j +1). Proceed to =b;+1; the order of traversal of 
the subgraph rooted at —b;+, is irrelevant. Continue to C(j + 1). Upon backtracking 
to C(j), visit the remaining unvisited nodes in the group component for C} in arbitrary 
order. 

If C; E C”, then proceed directly down to C(j + 1). Upon backtracking to C’(j + 1), 


visit =b;+1, then backtrack to L’ and visit L;y+1. Let £/, L3, Ly denote the instances of L’ 


APPENDIX A. IDENTIFYING DEPTH-FIRST REGULAR REFUTATIONS IS NP-COMPLETE93 


in the group component for C; which are adjacent to X1, X2, X3 respectively. From L% 


visit X; and X;,,. From L; visit X2 and from Lj visit Xy. From L visit Li, ,. Finally, 


j+: 
upon backtracking to C (j), visit R. 

To see that our traversal forms a spanning tree for 7, note that each x; and z; is 
visited before visiting C(j + 1) where x; € Cj+41. Thus when the traversal reaches C (n), 
each x; and z; has been visited. Thus when backtracking to C(j+1) such that Cj+1 ¢ C’, 
each of £i, Ziz, Zig, Which are not explicitly visited in our traversal, must already have 
been visited. All other vertices are explicitly visited in our definition of the traversal, 


hence the traversal forms a spanning tree. 


Given a point in the traversal (i.e. a vertex of m at the moment it is first visited), 
we say that we have seen a variable x if the path to m in the DFST contains an edge 
labelled with a literal of x. To see that the traversal is regular, we observe that there are 
only two cases in which an irregularity can occur: if at C(j) we have seen the variable 
x, but x; € Cjy1 € C’, or if at =b;,1 we have seen the variable c;y1 but not visited some 
initial clause z;, z; € Cj41. The latter case cannot occur, since if at =b;,/ we have seen 
Cj+1, then it must be that Cj41 ¢ C”, so the traversal has already visited C(n) and hence 
visited each initial clause z; for x; € X. Consider then the former case, wherein at C(j), 
Cj+1 € C” and we have seen the variable x; for some x; € Cj41. By definition, in this 
case it must be that x; € Cp for some k € |j + 1], contradicting C” being an exact cover, 


so this case also cannot occur. 


We now show that if 7 is a DFREG refutation, then (X, C) is a yes-instance of X3C. 
Suppose that T is a regular DFST for m. We construct an exact 3-cover C” for X from 
C as follows: for any j € [n], C; € C” if and only if for some i € [3m] an axiom Z; is 
reached in T by an edge labelled from {c,;,-c;}. It is easy to see that C” forms a cover: 
for each i € [3m], the axiom Z; appears in T and hence x; € C} for some C; € C’. We 
now show that C” is exact. Suppose for sake of contradiction that there exists i € [3m], 


and 1 < j < k < n such that x; € Cj, x; € Ck and both Cj and Cp are in C”. By 
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definition of C”, there exists 2;, i € [3m] such that axioms Z;, and Z;, are reached in T 
by edges labelled with literals of cj, Cg respectively. Then it must be that both C’(j) and 
C"(k) are reached in T by paths on which the variables c; and cz do not appear. Consider 
then the path p to C(k — 1) in T; it must be that each variable x; for xy € C} appears 
on p. But since cz, does not appear on the path to C’(k), it must be that each variable 
xy for x € Cj UC, appears on the path to C’(k). Then the variable x; appears twice 


along the path to C’(k), contradicting T being regular. Thus C” is an exact 3-cover for 


X, completing the proof. 


Appendix B 


Omitted proofs 


B.1 Chapter 2 


B.1.1 Section 2.3 


Proof of Theorem 2.11 We show that, for arbitrary integer 6 > 0, that the NP- 
complete problem CLIQUE is polytime reducible to both 4BCS and BMCS. Recall 
that CLIQUE is the problem of, given a graph G and integer k < |V(G)|, determining 
whether G contains a k-clique. Fix integer ô > 0 and instance (G, k} of CLIQUE where 
k < |V(G)| =def n. Assume without loss of generality that m =4ef |E| is at least (5) 
(since otherwise G trivially has no k-clique), and that k > 2. Set K =der ô (5) — 1. Define 


bipartite graph H as follows: 
L(H) =V x [n] 
R(H) = {vee E(G), € [FU du; : j EKO l iee € [n= Eel} 
veV 
We define E(H) by defining the neighborhood of arbitrary (v, 4} € L(H): 


[((u, £)) = {va : v € e,i € [6] U {uy : j € [K]} U (e; : 4 € [6], j € [n— Te@)|]} 
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Then |L(H)| = n?, and 


|R(H)| = K + ô[m +n? - X` |Pe(v)|] < ón?, 
vEV 


since K < ôm — 1 and the sum of the degrees of each vertex is 2m. Clearly H is 
constructible in time poly(n). We complete the reduction by forming instance (H, kn). 
We now prove some essential claims about H. For any S C L(H) define supp(S) to be 
the set of v € V(G) appearing in any s € S. Define edges(S) as the set of (u,v) € E(G) 


such that both u and v are in supp(S). 


Claim B.0.1 Let S C L(H) be of size at most kn. Then 


IPn(S)| =8> [nlsupptS! + (5) —ledues(5)1 -1 


Proof of Claim B.0.1 Fix S as in the claim. By definition of H we have that 


IP(S)| =K+6-|{e € E(G) : en supp(S) 40H + XO S(n—|T(w)). 


vEsupp(S) 


It is easily shown that the sum of the degrees of u € supp(S) is exactly 2|edges(S)| plus 
the number of edges in E(G) incident to exactly one v € supp(S), and hence exactly 


{e € E(G) : e N supp(S) 4 O}| + ledges(S)|. Therefore, 


T(S)| = K + ón|supp(S)| — dledges(S)|. 


Since K = 5 (5) — 1, Claim B.0.1 is completely proven. 


Claim B.0.2 Suppose S C L(H) where |S| < kn and |[(S)| < 6|S|. Then both 
|supp(S)| = k, and |S| = kn. 


Proof of Claim B.0.2 Fix S as in the claim. We first show that |supp(S)| = k by 
cases. Suppose that |supp(S)| = s < k. Clearly then |edges(S)| < ee and by Claim 


B.0.1 we have 


IP(S)| > (ns +1) — 1 > ns + (6 — 1) > dns 
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But clearly |S| < ns as there are n vertices in H for each u € supp(S), so we have 


D(S)| > 4|S|, a contradiction. Suppose then that |supp(S)| = s > k. Then by Claim 


rine ja) 


since s > k and |edges(S)| < (5). It is easily shown that, for x < n the function na — (5) 


B.0.1 we have that 


is strictly increasing. Then we have 
T(S)| > ónk — 1 > 6] S|, 


also a contradiction. We then conclude that |supp(S)| = k. To show |S| = kn, suppose 
for contradiction that |S| < kn. Since |supp(S)| = k we have that |edges(S)| < (5). 


Then by our assumptions on S and Claim B.0.1 we have 


5(nk — 1) > 6|S| > |P(S)| > ónk — 1 


Which implies that 6 < 1, a contradiction. Claim B.0.2 is completely proven. 


We now prove our reduction is correct for both 6BCS. Suppose first that G contains 
a k-clique C. Let S =gef C x [n] so that |S| = kn, |supp(S)| = k and ledges(S)| = (Gy: 
Then by Claim B.0.1, 


IP (S)| = okn — 1 < dkn, 


hence (H, kn) € BCS. Suppose then that there exists S C L(H) where |S| < kn and 
I(S)| < 815]. By Claim B.0.2, we have that |supp(S)| = k. Then by Claim B.0.1, we 


have 
k 
dkn > 6|S| > JS (5) = edges(5)| 


which implies that ledges(S)| > els hence supp(S) is a k-clique in G. Thus G has a 


k-clique if and only if (H,nk) € 6BCS. Furthermore, by Claim B.0.2, we have that every 
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S C L(A) of size strictly less than kn must have |['(S)| > 6|S|, so (H,nk) € 9BCS if 


and only if (H,nk) € 9BMCS. This completes the proof of Theorem 2.11. 


Proof of Theorem 2.13 By reduction from CLIQUE as in the proof of Theorem 2.11. 
Fix ô € (0,1) and instance G,k to CLIQUE. Set n =qgef |V(G)| and m =aer |E(G)]. 
Assume that n is sufficiently large so that n/d is an integer, and assume without loss of 
generality that m > (5) and k > 2. Set K =gef (5) — 1. Define bipartite graph H as 


follows: 


L(H) = V x [n/6] 


R(H) = (ve : e € E(G)}U fu, : je [K]}U U (3 : 3 € [rn — Lol) |} 
veV 


We define E(H) by defining the neighborhood of arbitrary (v, £} € L(H) as follows: 
P((v, &) = (ue: 0 € ep U {uy = 9 6 [K]} Ufa} = 3 € n— [Polo)l) 


Observe that |L(H)| = n?/8. Clearly H is constructible in time polynomial in n. We 
proceed to prove some essential facts about H. For any S C L(H) define supp(S) to be 
the set of v € V(G) appearing in any s € S. Define edges(S) as the set of (u,v) € E(G) 


such that both u and v are in supp(S). 
Claim B.0.3 For any S € L(H) where |S| < kn/ô, 


[P(S)| = K + n|supp(S)| — ledges(S)| 


Proof of Claim B.0.3 Fix S as in the claim. By definition of H we have 


IN(S)|=K+ Y (n—|Pa(v)|) + fe € E(G) : en supp(S) + 0). 


vEsupp(S) 


As in the proof of Claim B.0.1, it is easily shown that ` (s) Pa(v)| is exactly |{e € 


vVESUPP 


E(G) : eNsupp(S) 4 0H +ledges(S)|, hence |['(S)| is exactly K+n|supp(S)|—ledges(S)| 
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as desired. This completes the proof of Claim B.0.3. 


Claim B.0.4 For any S € L(A) with |S| < kn/6, if |(1(S)| < 6|S| then |supp(S)| = k. 


Proof of Claim B.0.4 Fix S as in the claim and assume that |D(S)| < 6|S|. Let 
Isupp(S)| = s. Suppose for sake of contradiction that s < k. Then ledges(S)| < K and 
by Claim B.0.3 we have that |D(S)| > nk > 6|S|, a contradiction. Suppose then, again 


for sake of contradiction, that s > k. By Claim B.0.3 we have |['(S)| > K+nk-— (3). But 


x 


for any z < n the function na — (5) is strictly increasing, hence |['(S)| > nk — 1 > 9/51, 


contradiction. Claim B.0.4 is completely proven. 


It suffices to show that G has a k-clique if and only if (H, kn/9) is in BCS. Suppose 
first that G contains a k-clique C. Set S = C x [n/0] so that |S| = kn/ó, |supp(S)| = k 
and |edges(S)| = eae Then by Claim B.0.3 we have |[(S)| = K + nk — K —1, hence 
IT(S)| < nk = ô|S| as desired. Suppose then that there exists S C L(H) where 
|S| < kn/d and |[(S)| < 6|S|. By Claim B.0.4 we have that |supp(S)| = k. Then 


by Claim B.0.3, nk > |[(S)| = K + nk — |edges(S)|, hence |edges(S)| > K +1 = (5) 


and so supp(S) is a k-clique in G. Theorem 2.13 is completely proven. 


B.2 Chapter 3 


B.2.1 Section 3.2 


Proof of Proposition 3.19 Let D be an algorithm which quasi-automatizes MRS. 


I71+Sr(7F0)) and produces 


Let a,b be constants such that D on input 7 runs in time 2!°8"( 
a refutation of size at most 2!°8’ Sa(*-9) Fix constants co, €o, 6 and d as in the proposition, 


with d large enough so that dd > 1. Let c be a constant so that |PHP(G)| < cn? for 
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G € G" , and sufficiently large n, and let c’ be the hidden constant of Lemma 3.14 such 
that Sp(PHP(G) E 0) < 2°"e™ for sufficiently large n. Set ho =aer C- €o and let hy be 


a constant such that 
hı log” n > ho log n + 3logn + loge 


for sufficiently large n. Define algorithm A as follows: given G € G/’_,, compute T =def 
PHP(G). Simulate D on input r for N =aep 21807" steps, and output 1 iff D 


e b . 
holog®n)” Clearly A runs in 


halts and outputs a refutation of size at most © =def 2 
quasipolynomial time, so it suffices to show that A is correct. 

Suppose first that r¢(1) < eolog? n. Then by Lemma 3.14 and our choice of e”, 
Sr(PHP(G)+ 0) < 2%0!8", Then by our choice of parameters, clearly D halts within 
the time bound and outputs a refutation of size at most S. 


Suppose then that G is an (€,,/n log” n, 9d)-expander for some constants €1, C1 to be 


specified later. Since 6d > 1, by Lemma 3.15 we have 
APEPI EO) S 20 yr Oakes 


for some constant k and sufficiently large n. Let s =4ef max{a,b}. Set constants 


Cı =def 2(Co + s) and e, = \/hi/(kd) + 1. Then by our choice of c,,€,, we have that 


Sr(PHP(G) F 0) is at least max{ N, S} and so D outputs 0. 


B.2.2 Section 3.3 


Proof of Lemma 3.28 Let 7 be a PC(k) refutation of PH P(G) and let f be a non- 
axiom line in 7 inferred from previous lines f1, f2. We proceed by cases over the rules 
of PC(k). For subsumption the result is trivial. We consider each rule of 2 premises in 
turn. Let t¡,t2 be arbitrary terms of fı, fo respectively such that it is not the case that 


ty la to. 
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Suppose f = Č V t is derived by AND-introduction from C V zı and C V (t \ {x1}). 
If t, or tg is in C then we are done. Otherwise, suppose p is a matching restriction 
satisfying both tı = xı and tz = t’ \ {x}; clearly then p satisfies t € f. Suppose then 


that f = C V D is derived by the cut rule from C V A jTi and D V Viet] gi Where each 


1€lj 


gi La zi. If either tı € C or tg € D then we are done. Otherwise, tı = Niecy xi and 


to = gi for some i € |j]; then tı La tz which by assumption cannot occur. 


Proof of Lemma 3.28 Fix a PC(k) refutation 7 of PH P(G) for some bipartite G and 
k > 1. Let p be a matching restriction over var( PH P(G)). We show by induction over 
steps that mr], is a PC(k) refutation of Gl). 

For the base case, by inspection PH P(G)}, is exactly PHP(G!,). For the inductive 
step, let f be a non-axiom line of m. If f is derived by subsumption then the claim is 
trivial. If flp = 1 then trivially we are done, so assume hereafter that flp 4 1. Suppose 
that f = C V t is derived by AND-introduction from CV x and CV (t\ {x}). If p falsifies 
x or t\ {x} then fl, = Cl, is derivable by a single subsumption step. Suppose then that 
alo (t\ Lap), E {0,1}. Then tlp = A (tA [x))), hence fl, follows by AND-introduction 
as in T. 

Suppose then that f = C V D is derived by cut rule from C V Niecy x; and DV Vicy Gi 
where each g; Lg xi. If p falsifies Niel j] Zi Or Viet g; then f follows by subsumption from 
either CT, or DÌ, respectively. It suffices to show that fl, follows by the cut rule. Let 
= Nici xi. If p satisfies some g;, then gi = Tay and x; = xa’b for some a’ Æ a and hence 
x; is not in t{,. Thus for each x; € tl, the term corresponding term g;/, is not 0 or 1, 
and clearly x; La gip, hence fl, is derivable by the cut rule. 

Thus by induction each line 7], not set to 1 is either an axiom of PH P(G{,) or follows 


by a PC(k) rule. Since 0 € 7 clearly 0 € Tr|,, so af, is a PC(k) refutation of PH P(G],). 
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Proof of Lemma 3.30 Fix G, 7 and k as in the lemma. Let 7’ be the PC(k) refuta- 
tion obtained by applying the monotone transformation to each line of 7. We show by 
induction over inferences in 7’ that we can derive Ma(f) for each line f € 7’ in at most 
three times as many inferences. Assume without loss of generality that each positive 
literal x;; has at least one positive literal £ such that £ Lg Ziz. 

For the base case, by inspection Ate = Mg( Ate) for each A € PHP(G). Let f be 
a line of rm” derived from fi, fip and assume that we have derived Magl fi) and Mal fia). 


We proceed by cases for the rule used to derive f. 


1. If f = CV D was derived by subsumption from C then clearly Ma(C V D) follows 


by subsumption from Mg(C). 


2. Suppose that f = C V D was derived by cut from DV Aye; Li and C V V je) Ci for 


vel; 


some j < k and where each 4; Lg C;. Let t denote the term Ncp 4. If t is not a 


ics] 
matching then Mg(DVt) = Ma(D) and by hypothesis we may derive Ma(CV D) 
in a single subsumption step. If t is a matching then f is derived by the cut rule, 


replacing C, D with Ma(C) and Ma(D) respectively. 


3. Suppose f = CV Nic; l; was derived by AND-introduction from Č V ¢; and C V 
\j=2.; li Let t denote the term Mg(A,<; li- Ift is a matching term then Ma(f) is 
also derived by AND-introduction replacing C with Mg(C). Assume then that t is 


not a matching. Ift = A L; is not a matching term then Ma(C Vt’) = Me¢(C) 


i=2..j 
and so Mg(CVt) = Ma(C) follows by subsumption. If t’ is a matching but t is not, 
then /; shares a common endpoint with some @;, 2 <i < j. Let l1 = xq for some 
i,j. If £a E t then Mo(f) is derived in a single subsumption from MG(D v 1”). 
If t contains some zas, then we derive by subsumption f’ = Mce V Zap V Vi. L; 
where each / Lg li for 2 <i < j. Cutting f’ with Ma(D Vt') then yields Mc(f). 


If t’ contains some Tay, then cut Ma(C) V £a with the monotone hole axiom HE 


to obtain f! = Ma(C) v C’ where C’ = Vaer(w),0ta Lay. Then C’ La Zay and as 
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in the previous case we use subsumption on f’ then cut with Ma(D Vt’) to derive 


Mal(C v D). 


Thus by induction we obtain Mg(0) = 0. In each inductive step we use at most 3 infer- 


ences and thus the resulting refutation has size at most 3|7”|. 


In the following proof of Lemma 3.31 we will use the following combinatorial lemma 


proven in [37]: 


Lemma B.1 (Lemma 10 [37]) Fix integers n,d,k > 0, r € [n] ands > 0. Let H € 
Gre! and let f be a matching k-disjunction over var(PHP(H)). Let R be the set of 
matching restrictions p € MH) such that h(Tm,,(f1,)) > s. Then there is an injection 
from R to the set 
LJ M,4;(H) x Clk, j) x [ad]? 
s/2<j<s 
where each C(k, j) is a set of size at most (k/n 2)’, and the first component of the image 


ofp ER is an extension of p. 


Proof of Lemma 3.31 Fix parameters n, a,c, s as in the lemma, and let b,r and A be 
as defined in the lemma. Fix H € G”*! satisfying conditions 1,2 and 3 of the lemma, and 
let f be an arbitrary matching log" n-disjunction over var(PH P(H)). Let B denote the 
event p € M(H) and C the event |M(H)|/|Ms5(H)|. Let A denote the “bad” event 


that Tay, (flp) has height at least s. We first write 


paro o] = Pel BIOI 
Pr[A, B,C] (B.1) 

T (1 — 2-108***m) Pr[C] 
Pr[A, B,C] (B.2) 


~ (1-05 Pm) (1—3/m) 
< Pr[A, B,C]/(1— 4/n), 
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where inequality (B.1) follows since p is uniformly distributed over Mhotes(p)(H), and 
(B.2) follows by assumption on H and the uniformity of S over R(G). It suffices to 
bound Pr[A, B,C]. Let R =a, AN BOC. By Lemma B.1, any restriction p € R may 


be uniquely mapped to some (p’, a, 3) € B; for some s/2 < j < s, where 
Bj =aes Mr+j(H) x C(k, j) x [A}, |C(k, 5)| < (k/n 2)’, 


and p’ is an extension of p. We bound the probability of such p € R associated with 
arbitrary s/2 < j < s and sum over all such j to obtain the desired bound. 
Fix j such that s/2 < j < s. We bound the total probability of p € R which map to 
(p', a, 3) € B; by comparing the probability of p under distribution M,(H) and the prob- 
' 


ability of p' under distribution M,.,;(H), which is sufficient since es Pini Sp = 
3. o€M,+5(H) P r+j(H) 


0] < 1. Let S’ =a4ef holes(p'), so that S C 5”. By definition, we have that 


Prom my lo =p] (05) Ms (H)| 
Promo =p] (MIMAED)]| ` di 


We then bound the ratio |Myg(H)|/|Ms(H)| as follows: by definition, any o” € My(H) 
is an extension of some unique o € Ms(H). In case o € M(H), which occurs for at 
least a (1 — 27 18%) fraction of Ms(H), then Hl, has maximum degree A and there 
are at most A’ matchings extending o in Ms(H). Otherwise, H|, has maximum degree 
6log” n by assumption on H and there are at most (6log”n)? matchings extending o in 
Ms (H). Thus for sufficiently large n (B.3) is bounded above by 


ie $ a+1 . = j a+l ? j 
e [AS + 2118 (6log? my!] < oa 2) i ÓN (SE ») | 


2 (o) h eft. g-ale log ny (B.4) 
rlog/8 y 
An Y | j f 
< SIT 91-A/(6log n) loen 7bA/ (log il 
(<=) (logn) 
(B.5) 
12 log?/8 y y? 
< ( jog (B.6) 
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Inequality (B.4) follows from j < s and definitions of A and r. Inequality (B.5) follows 
from s < 6log*n = A/log*n. Inequality (B.6) follows as 12(7b/8)loglogn < logn and 
n/r = 1/(1 — log”/*n) = 1 for sufficiently large n. Thus for n sufficiently large, the 


probability of a “bad” p € R associated with 7 is bounded by 
(12 log 3"/* n log’ n/ In 2) (6 log’/* ny? < (20 log? 4 n) (6 log’/* ny. 
Summing over s/2 < j < s, we have 


` (20 log? 8/4 ny (6 log? ny = (6log?/* n)* ` (20 log py 


s/2<j<s s/2<j<s 
< (6log*n -2010g Mp) PP Y" (1/3) (B.7) 
s/2<j<s 
< (720 log? py. (B.8) 


Inequality (B.7) follows from assumption c — 3b/4 < 0, so that 20 logt n < 1/3 for 
sufficiently large n, and standard bounds on geometric series. Inequality (B.7) follows 
from standard bounds on geometric series with constant ratio. Thus Pr[A|B,C] is at 


most (720 log??? n)s/2/(1 — 4/n) < 2(720log*’/? n)*/? for sufficiently large n. 
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